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Preface 


Objectives 


My objective for this text is to present the principles of statistical design and 
analysis for comparative scientific studies to graduate students in the experimental 
sciences and applied statistics. For this second edition, the primary title Design of 
Experiments was added to make the text more identifiable while leaving the 
original title to portray the text’s objective. Statistical Principles of Research 
Design and Analysis describes the philosophy that successful comparative 
experiments have clearly defined objectives that can be addressed by appropriate 
choices of treatment designs. In addition, the experiment design chosen to accom- 
modate the treatment design should provide the most efficient design in the context 
of the experiment and resources at the scientist’s disposal. I call this process 
research design—the total effort in a study that includes development of the re- 
search hypothesis, the choice of treatment design to address the research hypothe- 
sis, and the experiment design choice to facilitate efficient data collection. 


Orientation and Background 


The text is applications oriented, using the results of established theory, and does 
not include theoretical development. It has a classical design orientation that is in- 
tended to introduce to the student the fundamental principles and designs. These 
fundamentals constitute the necessary ingredients upon which statistical researchers 
and practitioners continually innovate, enhance, and expand upon in the develop- 
ment of useful and efficient design strategies for research. The text presumes rea- 
sonable facility with college algebra and a prerequisite introduction to statistical 
methods, including the basic ideas of random sampling, basic probability laws, con- 
fidence intervals, ¢ tests, analysis of variance, regression, and F' tests. 
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Examples and Exercises 


The examples and exercises used throughout the text are based on actual research 
studies whenever possible. Approximately one-half of the examples and exercises 
with data sets involve applications to various life and agricultural sciences, while 
the remaining half involve various applications to engineering, industrial, and 
chemistry research. | 


Structure and Coverage 


The first chapter emphasizes the role statistical science plays in the research study 
planning stages. Chapters 2 through 7 emphasize the association between research 
hypotheses and treatment designs and their analysis in the simplest setting of com- 
pletely randomized designs. Chapters 8 through 17 present developments of experi- 
ment designs and their analyses from the simplest blocking structures in the earlier 
chapters to more complex and specialized blocking designs and analyses in the later 
chapters. Care has been taken to preserve the research design paradigm with exam- 
ples drawn from a variety of sciences to illustrate the major topics in chapters. The 
reader is provided with a consistent reminder of the connection between the re- 
search objectives and the statistical design and analysis to meet those objectives. 
Interpretations are provided for the analyses. 

The book purposely contains more material than is necessary for a one-term 
course, primarily because not all instructors have preferences for the same material 
coverage. Courses emphasizing experiment designs, analysis of treatment designs, 
or combinations thereof can be developed with appropriate choice of chapters. 


New to the Second Edition | 


Chapter | is the keystone for everything that follows in the text since it establishes 
the relationship between research hypotheses and treatment designs, replication, 
randomization, and local control practices to set the stage for later chapters. In this 
edition, proper credits are given to Sir Ronald A. Fisher and his seminal paper in 
1926, which consolidated his unified concepts for valid inference from designed 
experiments. A case study has been added that illustrates the application of design 
principles discussed in the previous sections of this chapter. 

New or upgraded discussions, clarification statements (hopefully), as well as 
updated references to many topics have been added in various and sundry places 
throughout this edition. Some cosmetic changes were made to improve the overall 
appearance, including a generic analysis of variance table format in place of com- 
puter output. Also removed were illustrations with data for hand calculator compu- 
tation of the basic analysis of variance sums of squares. 

Computer programs constantly evolve in terms of what they do, the user 
interface, and how they display output. An abundance of commercial computer pro- 
grams are available to help design experiments for complete and incomplete blocks, 
fractional factorials, mixtures, response surfaces, and so forth, including the ran- 
domization schemes. Programs for analysis are even more numerous. The 
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attachment to a particular software package seems to be very personal with users 
and it is an interface that is better left alone in a textbook of this nature. 

Pointers to advanced methodology for analysis choices have been added, 
methods often based on maximum likelihood, such as those for variance com- 
ponent estimation in Chapter 7, mixed model software for split plots in Chapter 14, 
and choices for covariance matrices for repeated measure in Chapter 15. A snippet 
on the generalized linear model was added in Chapter 4 to let the student know that 
we have models and method that do not require the normal distribution assumption. 
It includes reference to several of the good generalized model analysis books spe- 
cifically designed for users of the generalized model. 

Simultaneous statistical inference has been upgraded in Chapter 3 with discus- 
sion on different strengths of inference available to the experimenter for post analy- 
sis of variance methods. Confidence interval level of inference with simultaneous 
confidence intervals is the preferred choice of inference for multiple comparisons 
followed by the confident inequalities option that was used in the first edition. 
Some references are provided in Chapter 3 to other tests that require software (now 
being implemented in some packages) to compute critical test values but which 
provide much better alternatives to some of the old standbys. 

Spread-location plots were added to Chapter 4 to aid evaluation of homogene- 
ous variance assumptions, and the residual-fitted spread plots were added to evalu- 
ate adequacy of the fitted model. The unrestricted mixed model is now the model 
choice for the designs in Chapter 7, if for no other reason than that it extends natu- 
rally from balanced to unbalanced designs. It is perhaps a matter of trivial concern 
to some, but it is a subject that is fairly evenly divided between the “restricted” and 
the “unrestricted” model camps. 

The newer Latinized resolvable incomplete block designs have been added to 
the cyclical design section in Chapter 10. They offer another dimension of blocking 
control for the a-designs while they preserve the orthogonality for that blocking 
criterion. Several pages on the general Taguchi method philosophy have been 
added to Chapter 12. Although the Taguchi method is very specialized to the manu- 
facturing research community, it does have some design and analysis components 
that should be acknowledged. An example with heterogeneous slopes among treat- 
ment groups for the covariate has been added in Chapter 17 to expand the discus- 
sion on analysis of covariance assumptions. 
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1 Research Design Principles 


All of our activities associated with planning and performing research studies have 
statistical implications. The principles we encounter in Chapter 1 form a basis for 
the structure of a research study; the structure, in turn, defines the study's function. 
lf the structure is sound the study will function properly and produce the information 
it was designed to provide. If the structure is faulty the study will not function : 
properly, and it will produce either incomplete or misleading information. The 
statistical principles are those associated with the collection of observations to gain 
‘maximum information for a research study in an efficient manner. They include 
treatment design, local control of variability, replication, randomization, and the effi- 
ciency of experiments. 


1.1 The Legacy of Sir Ronald A. Fisher 


No one person had as much impact on the statistical principles of experimental 
design in his time as Ronald A. Fisher. In October 1919, Fisher joined the staff at 
Rothamsted Experimental Station near Harpenden, England. He had been asked to 
come for a period of six months to one year to apply a thorough statistical analysis 
to agricultural research data that had been accumulated by the staff. 

It was during his tenure at Rothamsted, where he remained until 1933, that he 
developed and consolidated basic principles of design and analysis that we today 
take as necessary practices for valid research results. From 1919 to 1925, he studied 
and analyzed wheat experiments on Rothamstead Experimental Station that had 
been conducted since 1843. From his statistical investigations of these experiments 
and others, Fisher developed the analysis of variance and unified his ideas on basic 
principles of experimental design. 

‘In 1926 he published the first full account of his ideas in a paper, “The Ar- 
rangement of Field Experiments” (Fisher, 1926). In that seminal paper he outlined 
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and advanced three fundamental components for experiments in agricultural field 
trials: local control of field conditions for the reduction of experimental error, 
replication as a means to estimate experimental error variance, and randomization 
for valid estimation of experimental error variance. Although replication and local 
control were practiced at the time, his justifications with regards to experimental 
error variance were relatively new concepts. Randomization was a radical new 
concept met with skepticism and resistance by his contemporaries who, for the 
most part, did not understand its statistical implications. Each of these concepts is 
discussed in more detail in succeeding sections of this chapter. 

Two books that grew out of Fisher’s experiences at Rothamsted became stan- 
dard references for researchers on the design and analysis experimental studies. 
Statistical Principles for Research Workers was first published in 1925 (Fisher, 
1925) with 13 subsequent editions, and The Design of Experiments was published 
ten years later (Fisher, 1935) with 7 subsequent editions. His contributions to 
experimental design were but a few of his contributions to the scienee of statistics. 
A biography on the life and times of Fisher, written by one of his daughters (Box, 
1978), pays homage to him as the consummate scientist. 


1.2 Planning for Research 


A research program is an organized effort on the part of a scientist to acquire 
knowledge about a natural or manufactured process. The total program may require 
many individual studies, each with specific objectives. The individual studies usu- 
ally answer related questions and provide related pieces of information, which in 
concert meet the goals of the program. The design and analysis of the individual 
research study are the object of our attention in this book. 

Good planning helps the scientist to organize the required tasks for a research 
study. The individual study requires the scientist to make a number of critical deci- 
sions. Consider the nutrition scientist who wanted to improve the standard method 
for evaluating nutritional quality of different protein sources. Although the accepta- 
ble standard procedure for evaluation was fairly rigidly defined by peer scientists, 
he hypothesized that the substitution of mice for rats as the standard test animal was 
more time- and cost-efficient. A test of his research hypothesis required a study to 
determine whether the mice were more efficient. Among the critical decisions he 
had to make were the number of mice and rats to use; the amount of protein to use 
in the diets; the various sources of protein necessary to validate the new protocol: 
the length of time to run the study; and the number of replications for the full 
experiment. 


Documented Plans Prevent Oversights 


The importance of developing a written plan cannot be overemphasized. Frequent 
reference to an existing document prevents serious oversights. The document also 
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will be useful for subsequent insertions of notes and any alterations relating to the 
specific items in the original plan. 

The astute. investigator develops a checklist of specific considerations at the 
beginning of a study. Some typical items that a checklist can address are 


the specific objectives of the experiment 


e identification of influential factors and which of those factors to vary and 
which to hold constant 


e the characteristics to be measured 
e the specific procedures for conducting tests or measuring the characteristics 
e the number of repetitions of the basic experiment to conduct 


e available resources and materials 


Bicking (1954) presents a detailed checklist for planning a research study that 
can be consulted as a guide to develop a written plan. 


Simple Questions to Focus Activities 


Simple, but challenging, questions aid the design process, even though we may 
have a well-defined research hypothesis as an impetus for the research study. 

Questions that focus our attention throughout the design process include “What 
is my objective?” “What do I want to know?” and “Why do I want to know it?” 
Productive follow-up questions for each activity in the process—such as “How am 
I going to perform this task?” and “Why am I doing this task?”—direct our 
attention to define the role of each activity in the research study. 

Components of the research study are discussed separately in the following 
sections, but they are interconnected and an investigator must integrate those sepa- 
rate parts into an effective research study. We begin by establishing a small vocab- 
ulary to communicate our ideas. 


1.3 Experiments, Treatments, and Experimental Units 


Accurate communication requires that both parties respond to a common vocabu- 
lary with a common meaning. This section establishes the interpretation of some 
common terms and concepts as they are applied to scientific research studies. 

For our purposes, an experiment shall be confined to investigations that estab- 
lish a particular set of circumstances under a specified protocol to observe and 
evaluate implications of the resulting observations. The investigator establishes and 
controls the protocols in an experiment to evaluate and test something that for the 
most part is unknown up to that time. 

The comparative experiment is the type of experiment familiar to investiga- 
tors in the fields of biology, medicine, agriculture, engineering, psychology, and 
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other experimental sciences. The adjective comparative implies the establishment 

of more than one set of circumstances in the experiment, and that responses 

resulting from the differing circumstances will be compared with one another. 
Treatments are the set of circumstances created for the experiment in response 


to research hypotheses, and they are the focus of the investigation. Examples of 


treatments are animal diets, cultivars of a crop species, temperatures, soil types, and 
amounts of a nutrient. Two or more treatments are used in a comparative study, and 
they are compared with one another for their effects on the subjects of the study. 

The experimental unit is the physical entity or subject exposed to the treat- 
ment independently of other units. The experimental unit, upon exposure to the 
treatment, constitutes a single replication of the treatment. 

‘xperimental error describes the variation among identically and indepen- 
dently treated experimental units. The various origins of experimental error include 
(1) the natural variation among experimental units; (2) variability in measurement 
of the response; (3) inability to reproduce the treatment conditions exactly from one 
unit to another; (4) interaction of treatments and experimental units; and (5) any 
other extraneous factors that influence the measured characteristics. 

A beef cattle feeding trial provides an example of natural variation among 
experimental units. Two cattle of the same breed and herd receive the same amount 
of a diet; yet one steer gains 2.0 pounds per day, and the other gains 2.3 pounds per 
day over a one-month period. 

The inability to reproduce treatment conditions exactly occurs when replicate 
test tubes are prepared independently, containing the same mixture of compounds, 
and the resultant chemical product is weighed and found to differ in each of the 
tubes by 0.1 yg. Weighing or pipetting processes are not exact; therefore, a small 
amount of variation is introduced at the treatment preparation stage. 

A major objective in statistical galculation:s is the attainment of an estimate for 
the variance of experimental error. In its simplest form, the experimental error 
variance is the variance of observations on experimental units for which the 
differences among the observations can be attributed only to experimental error. 
Many of the statistical procedures we use require an estimate of this variance. 
Examples include confidence interval estimates for a mean and the two-sample 
Student ¢ tests for the hypothesis of no differences between the means of two 
treatment populations. 

Comparative observational studies are those studies for which we would like 


= A 


to conduct an experiment but cannot do so for practical or ethical reasons. The 
investigator has in mind conditions or treatments that have causal effects on 
subjects for which experiments cannot be conducted to elicit responses. 

Investigators in the social sciences, ecology, wildlife, fisheries, and other 
natural resource sciences often must conduct observational studies in lieu of direct 
experimentation. The basic unit of study in the investigation may be human 
subjects, individual animals, habitats, or other microcosms; they have the same role 
as the experimental unit in the designed experimental study. 

The subjects are either self-selected into identifiable groups or they simply 
exist in their particular circumstances. The groups or circumstances are used as 
treatment classifications in the observational study. By contrast, the investigator 


1.4 RESEARCH HYPOTHESES GENERATE TREATMENT DESIGNS 5 


assigns treatments to the experimental units in a designed experiment. For example, 
to study nitrification in soils from pure and mixed stands of mature pine and oak 
trees, a soil scientist selects existing pure and mixed stands of the two species and 
collects the necessary observations from the selected sites. A true experiment is 
quite impractical in this case because it requires an extraordinary expenditure of 
time to establish mature stands of the trees. 

Ethical considerations sometimes prevent the use of experiments in lieu of 
observational studies. Consider a study to compare the severity of automobile 
accident injuries with and without the use of seat belts. It would be clearly unethi- 
cal to assign anyone randomly to a “seat belt” or a “no seat belt” treatment and then 
collide the automobile into a concrete wall nor would anyone agree to this. Rather, 
investigators would rely on injury data collected from accidents and compare the 
“seat belt” data with the “no seat belt” data. 

The nature of the scientific inference is the primary difference between the de- 
signed experiment and the observational study. With the designed experiment it is 
often possible to assign causal relationships between the responses and the 
treatments. Observational studies are limited to association relationships between 
the responses and treatment conditions. 


1.4 Research Hypotheses Generate Treatment Designs 


The research hypothesis establishes a set of circumstances and the consequences 
that follow from those circumstances. The treatments are a creation of the circum- 
stances for the experiment. Thus, it is important to identify the treatments relative 
to the role they each have in the evaluation of the research hypothesis. A failure to 
clearly delineate the research hypothesis and objective of the study can lead to dif- 
ficulties in the choice of treatments and to unsuccessful experiments. 


The Relationship Between Treatments and Hypotheses 


When treatments are chosen properly in response to a research hypothesis the 
underlying mechanisms may be better understood, whether they be physical, chem- 
ical, biological, or social. In some cases the objective may be to “pick the winner” 
to find one treatment that provides the desired response. In other cases, the experi- 
ment is used to elucidate underlying mechanisms associated with the treatments as 
they affect measured response variables. In the latter case sound research hypothe- 
ses motivate the selection of treatments. 

It is incumbent on the investigator to ensure that the choice of treatments is 
consistent with the research hypothesis. It may be sufficient and less difficult to 
design a study solely to discover the best treatment. However, with a little extra ef- 
fort even more fundamental information may be derived from the experiment in re- 
sponse to research hypotheses. 
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The following illustrate treatments used in actual research settings generated 
by research hypotheses: 


e The drinking kinetics of honeybees was studied under different ambient 
temperatures to address the hypothesis that the energy required by 
honeybees to retrieve food for the colony was dependent on the ambient air 
temperature. 


e The survival of Euphorbia seedlings under attack by a soil pathogen was 
determined for different types of fungicide treatment to address the hy- 
pothesis that not all fungicides were equally effective in controlling the soil 
pathogen. 


e In traffic engineering, several methods of measuring traffic delay at an in- 
tersection were evaluated under different types of traffic signal configura- 
tions to address the hypothesis that the method of measuring delay was de- 
pendent on the type of configuration used for signaling traffic at an 
intersection. 


e The development of social competence in young children was measured for 
its relationship to (1) parent education, (2) parent income, (3) family 
structure, and (4) age of child to address a complex research hypothesis that 
certain family demographics favorably affect the development of a child. 


Note that in some research settings the treatments are conditions imposed by 
the investigator, such as those involving the honeybees and the survival of seed- 
lings. On the other hand, in the traffic engineering and child development studies 
the treatments were those of existing conditions. Whether the investigator is per- 
forming a designed experiment or an observational study, he or she has the task of 
selecting the proper treatments to address the research hypotheses. 

Frequently, additional treatments are required to fully evaluate the conse- 
quences of the hypotheses. An important component of many treatment designs is 
the control treatment discussed in the following section. 


Control Treatments Are Benchmarks 


The control treatment is a necessary benchmark treatment to evaluate the effec- 
tiveness of experimental treatments. There are several circumstances in which a 
control treatment is useful and necessary. 

Conditions of the experiment may disallow the effectiveness of the experi- 
mental treatments that are known generally to be effective. A control of no treat- 
ment will reveal the conditions under which the experiment was conducted. For 
example, nitrogenous fertilizers are generally effective but will fail to produce re- 
sponses in fields with high fertility. A control of no nitrogenous fertilizer will re- 
veal the base fertility conditions for the experiment. 


1.4 RESEARCH HYPOTHESES GENERATE TREATMENT DESIGNS 7 


Sometimes treatments require manipuiating the experimental units or subjects 
where the manipulation alone can produce a response. Placebo controls establish a 
basis for treatment effectiveness in these cases. The placebo unit or subject is proc- 
essed just as the treatment units, but the active treatment is not included in their 
protocol. One of the most famous health experiments, the 1954 field trial of the 
Salk poliomyelitis vaccine, utilized placebo controls in approximately one-half of 
the test areas in the United States. The placebo was prepared to look just like the 
vaccine, but without the antigenic activity of the poliomyelitis vaccine. The placebo 
subjects were inoculated in the same fashion as were the subjects receiving the 
vaccine (Tanur et al., 1978). 

Finally, the control may represent a standard practice to which the experi- 
mental method may be compared. In some situations it is necessary to include two 
distinct types of controls. For example, the no treatment and the placebo treatment 
can reveal the effect of manipulating the experimental unit in the absence of any 
treatment. 


Multiple-Factor Treatment Designs Expand Inferences 


In “The Arrangement of Field Experiments,” Fisher (1926) noted that no proverb in 
connection with field experiments at that time was more repeated than that which 
said, 


We must ask Nature few questions, and ideally, one at a time. 


He was convinced this was a mistaken view. In this regard, he wrote, “Nature... 
will best respond to a logical and carefully thought out questionnaire; indeed, if we 
ask her a single question, she will often refuse to answer until some other topic has 
been discussed.” 

Fisher understood that in natural systems we don’t know whether one treat- 
ment’s influence is independent of others or its influence is related to variation in 
the other treatments. Consequently, the conditions under which treatments are com- 
pared can be an important aspect of the design. 

For example, in a study on nitrogen production by Rhizobium bacteria in soil a 
comparison of interest was the amount of nitrogen produced in normal, saline, and 
sodic soils. However, the nitrogen production was known to be affected by the tem- 
perature and moisture conditions in the soil. In fact, the optimum conditions of tem- 
perature and moisture for nitrogen production may well have been different for 
each soil type. Consequently, the experiment was set up to test the nitrogen pro- 
duction at several temperatures in combination with several moisture conditions for 
each of the soils. Treatment designs of this type are known as factorial treatment 
designs, wherein one set of treatments, say soils, is tested over one or more other 
sets of treatments, such as moisture and temperature. 

A factor is a particular group of treatments. Temperature, moisture, and soil 
type are each considered a factor. The several categories of each factor are termed 
levels of the factor. Temperature levels are 20°C, 30°C, and 40°C; while soil type 
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levels are normal, saline, and sodic. A quantitative factor has levels associated with 
ordered points on some metrical scale such as temperature. The levels of a 
qualitative factor represent distinct categories or classifications, such as soil type, 
that cannot be arranged in order of magnitude. 

The factorial arrangement consists of all possible combinations of the levels of 
the treatment factors. For example, the factorial arrangement for three levels of 
temperature with three levels of soil type consists of 3 x 3 = 9 factorial treatment 
combinations. The nine combinations are 


(20°C, normal) (30°C, normal) (40°C, normal) 
(20°C, saline) (30°C, saline) (40°C, saline) 
(20°C, sodic) (30°C, sodic) (40°C, sodic) 


With this arrangement the investigator was able to evaluate the nitrogen pro- 
duction in each of the soil types and also to determine the influence of temperature 
conditions on the relative production by soil type. In addition, it was also possible 
to evaluate separately the influence of temperature on nitrogen production. The sta- 
tistical analysis of factorial arrangements is discussed in Chapters 6 and 7. 


1.5 Local Control of Experimental Errors 


Precise and accurate comparisons among treatments over an appropriate range of 
conditions are the primary objectives of most experiments. These objectives require 
precise estimates of means and powerful statistical tests. Reduced experimental er- 
ror variance increases the possibility of achieving these objectives. Local control 
describes the actions an investigator employs to reduce or control experimental 
error, increase accuracy of observations, and establish the inference base of a study. 

The investigator controls (1) technique, (2) selection of the experimental units, 
(3) blocking or ensuring parity of information on all treatments, (4) choice of ex- 
periment design, and (5) measurement of covariates. Each of these is discussed in 
this section in more detail. 


Technique Affects Variation and Bias 


If experimental tasks are performed in a slipshod manner the observations will ex- 
hibit an increase in variation. Technique includes simple tasks such as accurate 
measurement, preparation of media, pipetting of solutions, or calibration of instru- 
ments. On a more complex level the investigator may have several alternative 
laboratory methods or instruments for measuring chemical or physical properties. 
The methods may vary in their accuracy, precision, and range of application. The 
researcher must choose the method or instrument that provides the most accurate 
and precise set of observations within the budgeted resources. 

Vhen technique adversely affects precision the estimated experimental error 
variances are unnecessarily inflated. Outlying observations caused by recording er- 
rors or extraordinary environmental conditions can increase variation. Whatever the 
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cause the investigator has to decide whether to include these observations in the 
analysis. | 

On the other hand, no discernible pattern may exist in the observations other 
than a general increase in their variability. This type of increased variation could 
point to faulty technique somewhere in the course of the experiment. The investi- 
gator would need to check out the selection of experimental units for the study, 
treatment protocol, measurement techniques, and personnel for sources of increased 
error and then attempt to make adjustments wherever necessary. 

Poor techniques can affect the accuracy of observations, introducing bias into 
the results. Further, the variation introduced into the observations by poor tech- 
nique is not necessarily random and, therefore, not subject to the same probability 
laws we associate with statistical inference. 

Uniform application of treatments throughout the experiment increases the 
likelihood of unbiased measurement of the treatment effects. For instance, uniform 
amounts of food intake are required for accurate measures of differences between 
diets for animals. Uniform amounts of fertilizers applied to the replicate plots are 
required for accurate measures of crop yield differences in fertilizer trials. 


Selection of Uniform Experimental Units 


Heterogeneous experimental units produce large values for the experimental error 
variance. Precise comparisons among the treatments require the selection of uni- 
form experimental units to reduce experimental error. However, an excessively 
stringent selection of experimental units can produce artificially uniform condi- 
tions. The narrow set of conditions restricts the inference base of the study. The 
fore, to have reasonable confidence in the conclusions secured from the 
experiment, it is desirable to have units represent a sufficient range of conditions 
without unnecessarily increasing the heterogeneity of the experimental units. 

The nature of the experiment dictates the balance between range of conditions 
and uniformity of units. For example, plant selections in a plant-breeding study 
should be tested over the range of conditions in which future varieties are expected 
to be cultivated; hence the range of conditions may be quite wide. If the selections 
are tested in several widely separated locations, then uniformity within a location 
becomes important. Selecting a uniform set of plots controls the variation within a 
location. Uniformity of units in an experiment with dairy cows requires selection of 
cows from the same breed, at the same stage of lactation, and with similar numbers 
of previous lactations. 


Blocking to Reduce Experimental Error Variation 


Fisher (1926) argued that no advantage had to be given up to have a valid estimate 
of error but two things were necessary: 


(a) that a sharp distinction should be drawn between those components of error which 
are to be eliminated in the field, and those which are not to be eliminated... (b) that 
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the statistical process of the estimation of error shall be modified so as to take account 
of the field arrangement, and so that the components of error actually eliminated in 
the field shall equally be eliminated in the statistical laboratory. 


Blocking provides local control of the environment to reduce experimental er- 
ror. The experimental units are grouped such that the variability of units within the 
groups is less than that among all units prior to grouping. The practice of blocking 
or grouping experimental units into homogeneous sets goes hand in hand with ex- 
perimental unit selection for uniformity. Treatments are compared with one anoth- 
er within the groups of units in a more uniform environment, and the differences 
between the treatments are not confused with large differences between experimen- 
tal units. The variability associated with environmental differences among groups 
of units can be separated from experimental error in the statistical analysis. 

Experimental units are blocked into groups of similar units on the basis of a 
factor or factors that are expected or known to have some relationship with the re- 
sponse variable or the measurement that is hypothesized to respond differentially to 
the several treatments. 


Four Major Criteria for Blocking 


Four major criteria frequently used to block experimental units are (1) proxim- 
ity (neighboring field plots), (2) physical characteristics (age or weight), (3) time, 
and (4) management of tasks in the experiment. 

The classical blocking practice had its origins in agricultural field experiments 
when contiguous plots were placed into one group, and each of the treatments was 
assigned to a plot in that group. Then a second set of contiguous plots was used in 
the same fashion, and so forth, leading to a complete block design. 

The rationale for this type of grouping is that plots near one another are more 
alike than plots separated by greater distance in cultivated fields. Blocking patterns 
in agricultural trials may not necessarily be nice rectangular configurations. Exist 
ing variability patterns can require rather different blocking arrangements to sernuidl 
the experimental error. 

Another natural blocking unit is the animal litter. The size of litters in some 
animal species allows several treatments to be accommodated by one litter. Body 
weight is used to advantage as a blocking factor in animal experiments if variation 
in body weight amplifies variation in the response variable. | 

Industrial experiments require homogeneous batches of raw materials. A repli- 
cated experiment may require more raw material than that provided by a single 
batch, and variation from batch to batch may increase experimental error. A single 
batch sufficiently large for one replication of all treatments can serve as a blocking 
unit. 

Blocking is used to divide the experiment into reasonably sized units for uni- 
form management of time or tasks. Days serve as convenient blocking units if only 
one replication of treatments can be harvested in the field or processed in the 
laboratory in a single day. 
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Individual technicians can serve as blocking units to avoid confusion of ob- 
server or technician variability with that of the treatments. Each person can be as- 
signed to one replicate of each treatment when several people are available to 
record data or perform the laboratory analyses. 


A Demonstration of Variance Reduction by Blocking 


A uniformity trial best illustrates the potential effectiveness blocking may 
have for variance reduction in a research study. The uniformity trial is essentially 
an experiment in which the experimental units are measured but have not been sub- 
jected to any treatment. For example, the classic uniformity trial in agriculture may 
have been a field of wheat all of the same variety divided into plots all of the same 
dimension. The wheat yields were then measured on each plot. Because variation in 
agricultural fields can generally occur on gradients, it was possible to determine 
which groupings of adjacent field plots led to groups of plots with the smallest vari- 
ance within the groups. In following years’ experiments the treatments could be 
allocated within the groups of similar plots based on the results from the uniformity 
trial. 

Similarly, baseline observation prior to treatment application on the measure- 
ment of interest, or some variable that is known to have a strong relationship to the 
measurement of interest, is equivalent to a uniformity trial for blocking purposes. 
For example, measurements such as weights, age, chemical composition, or choles- 
terol levels prior to treatment administration may suitably be used as blocking 
criteria if they have strong relationships to the measurement of interest or are 
themselves the measurement of interest. 

Suppose we have observations on ten units from either a uniformity trial or 
measurements prior to treatment administration: 43, 72, 46, 66, 49, 68, 50, 76, 42, 
and 69. The mean and variance for the observations on these ten units are 7 = 58 
and s* = 175. Grouping the units into two blocks based on the size of those mea- 
surements, we have 


Block: 1: «.43;,.46,;.49;50;42. © = 46-:s?:= 12.5 
Block 2:):-72;66,68, 16:69. = 70 s* = 15.2 


Whereas the total variance among the ten units was equal to 175, the variances 
within the separate blocks have been reduced to 12.5 and 15.2, respectively. The 
component of error eliminated by blocking will be reflected in the variance be- 
tween the two block means, 46 and 70. The variability within each of the blocks is 
now assumed to represent the natural variation that exists among the relatively uni- 
form experimental units unencumbered by controllable environmental differences. 
Likewise, the comparisons among treatments within those blocks will not be influ- 
enced by those same controllable environmental differences. 
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Matching Strategies to Group Units 


Grouping of units often utilizes strategies to match similar units. Subjects or 
units are chosen for treatment groups on the basis of equality with respect to all 
influential factors. Any variables thought to have an influence on the value of the 
observed characteristics of the subjects are candidates for controlling variables. 
Matched subjects have common values for the controlling variables with the excep- 
tion of the treatment. 

The matching strategies at the study design stage attempt to achieve compara- 
bility of subjects on all factors that could unduly bias the comparison of the treat- 
ments. Pair matching and nonpair matching are two general strategies employed in 
the selection of units or subjects. 

With pair matching, a subject from each treatment is paired with subjects hav- 
ing the same controlling variable values in each of the other treatment groups. The 
values of the controlling variables may be chosen to be (1) exact value matches or 
(2) caliper value matches. Exact matches for human subjects are possible with vari- 
ables such as gender, profession, use of seat belt, and education level. Exact match- 
ing of roadway segments in a study of traffic performance is possible with variables 
such as number of lanes, lane width, presence or absence of a median, and speed 
limit. 

The caliper match allows a certain tolerance in the value of the matching varia- 
bles. A study on manipulation of a forest ecosystem may require study sites to be 
matched by tree species composition, slope, and aspect. Exact site matches on com- 
position, slope, and aspect are quite difficult to attain. However, it may be possible 
to obtain sites with similar values for any or all of the matching variables to the ex- 
tent that the variation will not have a serious effect on treatment comparisons. 

Nonpair matching may be accomplished through (1) a frequency or (2) a mean 
strategy. The frequency method stratifies the units into groups on the basis of the 
controlling variables. Suppose age is a potential influential variable in a study in- 
volving human subjects. The subjects can be stratified from a frequency distribu- 
tion of their ages such that there is a sufficient number of subjects in each age 
stratum to accommodate all treatment groups. 

The mean-based nonpair strategy groups subjects or units in such a way that 
the average values of the controlling variables are the same in each of the treatment 
groups. Experimental animals can be grouped such that their average weight is the 
same for all treatment groups. 

The nature of the research study dictates the most effective matching strategy 
and whether matching is a desirable protocol. Details on matching methods, includ- 
ing advantages and disadvantages, for comparative observational studies can be 
found in Cochran (1983), Kish (1987), and Fleiss (1981). 


Experiment Designs Accommodate Treatment Designs 


The experiment design is the arrangement of experimental units used to control 
experimental error and, at the same time, accommodate the treatment design in an 
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experiment. A wide variety of experiment design arrangements to control experi- 
mental error exists in the published literature, and there is a natural tendency to de- 
sign an experiment that conforms to the existing designs. But, a more appropriate 
attitude is to develop an experiment design that satisfies the demands of the current 
experiment. 

The attainment of maximum information, precision, and accuracy in the re- 
sults, along with the most efficient use of existing resources, is a guiding principle 
in choosing an appropriate experiment design. 

The association between a treatment design and an experiment design is illus- 
trated here under two different settings. The first illustration shows how three treat- 
ments are associated with six experimental units when each treatment is assigned to 
two experimental units. The second illustration shows how the three treatments are 
assigned to the six experimental units after the units are blocked into two sets of 
three homogeneous units. 


An Experiment Design Without Blocking 


Consider an experiment to compare three gasoline additives for their effect on 
carbon monoxide emission. Automobile engines are used as experimental units. 
Each of the three gasoline additives is to be used with two engines. A schematic 
representation of the design is shown in Figure 1.1. The boxes represent automobile 
engines as experimental units. One of the three gasoline additives is administered to 
an engine independently of the other engines in the experiment. The three gasoline 
additive treatments are randomly allocated to the six engines, two units for each 
treatment. 


Experimental unit 


(Engine) 


Additive A Additive B Additive C 


ia 


Figure 1.1 Illustration of a completely randomized design, with three treatments, 
each on two experimental units 


on, Kn 


_ This design, known as the completely randomized design, is the simplest of 
the experiment designs. The treatments are allocated to the experimental units at 
random. Each experimental unit has the same chance of receiving any treatment. 


The function of randomization in experiment designs is discussed in Section 1.8. 
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An Experiment with One Blocking Criterion 


The completely randomized design provides little control over environmental 
variation. Several general classes of designs use blocking and grouping of experi- 
mental units to control environmental variation. The simplest blocking design is the 
randomized complete block design with one blocking criterion. This design em- 
ploys one restriction on the random assignment of treatments to experimental units; 
all treatments must occur an equal number of times in each block. 

Consider the arrangement of units in a randomized complete block design for 
an experiment to study the effects of three diets on the growth of laboratory mice. 
Three mice of the same gender were selected from each of two litters. The litters 
were used as blocks for the experiment as shown in Figure 1.2. The three diets were 
randomly assigned to the three mice in each litter. 


Litter 1 Litter 2 
Mouse 1 Mouse 2 Mouse3 Mouse 4 Mouse5 Mouse6 


ae = 


Diet A Diet C 


Figure 1.2 [Illustration of a randomized complete block design, with three experi- 
mental diets tested in two litters of mice 


The use of a litter as a block reduces experimental error variation because it 
isolates the variation among litters from the variation among diets. The diets can be 
compared under uniform conditions when each of them is tested in the same litter. 
A complete discussion of the randomized complete block design and its analysis is 
given in Chapter 8. 


Covariates for Statistical Control of Variation 


Covariates are variables that are related to the response variable of interest. The 
information on covariates is used to enact a statistical control on experimental error 
variance by a procedure known as the analysis of covariance. 

It was suggested earlier that body weight can be used as a criterion for block- 
ing. After animals are grouped according to weight no further use is made of those 
values. But, the actual body weights can be used effectively in the statistical model 
to reduce the estimates of experimental error in lieu of or in addition to using them 
for blocking. Body weight would be a covariate of weight gain in the experiment. 
Other examples of covariates might be pretest scores, field plot fertility, previous 
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year’s yield in perennial crops, or purity of raw material in a chemical process; all 
may vary from unit to unit. Any attribute that is measurable and thought to have a 
statistical relationship to the variable of primary interest is a candidate for covariate 
adjustment. 

‘he prime requirement is that the covariate be unaffected by the particular 
treatments used in the experiment. In practice the covariates are measured before 
the treatments are administered or before the treatments have had time to develop a 
response by the unit, or we can assume the treatment never has an effect on the 
covariate. 

The observations in the experiment consist of pairs of observations (z, y) on 
each experimental unit, where y is the variable of interest in the experiment and z is 
the covariate. Suppose there are six such pairs of observations for each of two 
treatment groups, and the data are plotted as shown in Figure 1.3. 


y O = Treatment 1 
@ = Treatment 2 


(Xa, Yo) 


Figure 1.3 An illustration of covariate adjustment for two treatment groups 


A portion of the variation in y is associated with z as well as with the treatment 
effects if there is a statistical relationship between the variable of interest, y, and co- 
variate x. The analysis of covariance estimates the regression relationship between 
y and the covariate x to statistically reduce the experimental error variance. The 
average responses to each treatment, 7, and Y,, are adjusted to the same value of 
the covariate, usually the grand mean & as shown in Figure 1.3. A comparison 
between the adjusted treatment means, 7,, and Y,,, eliminates the influence of the 
covariate x on the comparison. For example, if weight gain has the covariate of ini- 
tial body weight, the average weight gain response to a treatment is adjusted to re- 
move the variation associated with initial body weight. The adjusted treatment 
means for weight gain represent the weight gains that would be obtained if all ani- 
mals had the same initial body weight. The analysis of covariance is discussed in 
Chapter 17. 
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1.6 Replication for Valid Experiments 


The scientific community regards replication of experiments to be a prime requisite 
for valid experimental results. Replication implies an independent repetition of the 
basic experiment. More specifically, each treatment is applied independently to 
each of two or more experimental units. 

There are several reasons for replicating an experiment, most notably: 


e Replication demonstrates the results to be reproducible, at least under the 
current experimental conditions. 


e Replication provides a degree of insurance against aberrant results in the 
experiment due to unforeseen accidents. 


e Replication provides the means to estimate experimental error variance. 
Even if prior experimentation provided estimates of variance, the estimate 
from the present experiment may be more accurate because it reflects the 
current behavior of observations. 


e Replication provides the capacity to increase the precision for estimates of 
treatment means. Increasing replication, r, decreases Sy = sr, thereby 


increasing the precision of y. 


Observational Units and Experimental Units Can Be Distinctly Different 


The observational unit may not be equivalent to the experimental unit. The obser- 
vational unit can be a sample from the experimental unit, such as individual plant 
samples from a field plot or serum samples from a subject. 

The variance of the observations on the experimental units is the experimental 
error variance. It is a valid measure of the variation among the experimental units 
that have independently had the treatments administered to them. 

The variance among multiple observations from the same experimental unit of- 


_ten is used mistakenly as a measure of experimental error for comparisons among 


treatments. The following examples may help to clarify the distinction between 
replicated experimental units and multiple observations from the same experimental 
unit. 


Example 1.1 A simple animal diet ration study has one cage or pen of six 
animals assigned to ration A and a second cage or pen of six animals assigned 
to ration B. Weight gain or some other appropriate data are collected to test 
the efficacy of the rations. The necessary measurements are made on each of 
the animals in the pens at the end of the study. The schematic in Figure 1.4 
illustrates the design. 


Typically, the Student ¢ test using the difference between the means of the two 
pens would be used to test a hypothesis of no difference between the rations. 


1.6 REPLICATION FOR VALID EXPERIMENTS 17 


Pen 1 Pen2 
Ration A Ration B 


bead (| |] 
2.5m 


bal = Animal with Ration A ba = Animal with Ration B 


Figure 1.4 Illustration of an unreplicated experiment 


However, the differences between the two pens of animals can be caused by the ef- 
fect of other factors in addition to the treatments. The response to a given ration 
may vary with different pens of animals. This variation in response could be due to 
one or more of the factors contributing to experimental error. 

The natural variation in the response from one pen to another, or the pen effect, 
will contribute to experimental error. Also, variation in the preparation and presen- 
tation of the treatment to the separate pens can cause a treatment—pen interaction. 
Therefore, any differences between the two rations in Example 1.1 cannot be attri- 
buted clearly to the rations alone. The differences may be attributable to combina- 
tions of the treatment effects, pen effects, and treatment—pen interactions. 

The experiment will not have resolved clearly the question of whether the two 
rations differed in their effect on weight gain. The experiment has only one true 
replication. The pen is the experimental unit because that is the unit to which the 
treatment was administered independently. The animals in the pen are the observa- 
tional units. The variance calculated among the observations on the animals within 
the pens is only an estimate of observation error within a pen of animals and not an 
estimate of variance among the experimental units. 


Example 1.2 Suppose the experiment in Example 1.1 is restructured such 
that the animals are randomly divided into four pens of three animals each. 
Further, each of the two rations is randomly assigned to two pens of animals 
as shown in Figure 1.5. The rations are administered independently to each of 
the pens. 


Each ration has been replicated properly in Example 1.2. The experimental 
units are the pens to which the rations have been administered independently (two 
pens per ration), and the animals within the pens are still the observational units. 
Therefore, the response from the experimental unit is the average pen response. 

The estimate of experimental error variance, s*, calculated as the variance 
among the pen means within each of the rations, is the appropriate variance for the 
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Ration A Ration B 
Pen 1 Pen 2 Pen 3 Pen 4 
a = Animal with Ration A Ee = Animal with Ration B 


Figure 1.5 Illustration of a replicated experiment 


Student ¢ test. The variance among the animals within each pen, say s2,, is a mea- 
sure of the variability of the observational units within the replicate pens. 
Thus, two levels of variation are present in this type of study: (1) the variation 
among observational units within experimental units (s2,), and (2) variation among 
the experimental units (s?). 
It is important to distinguish which units of the study constitute the experimen- 
tal unit and consequently which units constitute true replication of the treatment. 
More details on the statistical models and analysis for the pseudoreplication 
and true replication exhibited by Examples 1.1 and 1.2 may be found in Addelman 
(1970). Examples of pseudoreplication in agronomic studies may be found in 
Nelson and Rawlings (1983). Hurlbert (1984) provided numerous examples of 
pseudoreplication in ecological field experiments. 


1.4 How Many Replications? 


The number of replications in a research study affects the precision of estimates for 
treatment means and the power of statistical tests to detect differences among the 
means of treatment groups. However, the cost of conducting research studies 
constrains the number of replications for a reasonably sized study. Thus, replication 
numbers are determined on the basis of practical constraints that we can assign to 
the problem. 


Replication Numbers for Testing Hypotheses 


‘he method for determining the number of replications is often based on a test of a 
hypothesis about differences among treatment group means. An elementary method 
for experiments with two independent samples is used here to illustrate a few attri- 
butes of the replication number problem. 
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The method is based on a test of a hypothesis about the difference between two 
treatment group means d = m; — m,, with known experimental error variance s?, 
using the standard normal distribution test statistic. The method determines the - 
number of replications required to test the difference between two independent 
sample means with specified Type I and Type II errors. 

The required number of replications is affected primarily by four factors that 
are required for calculations: 


e the variance (o7) 


e the size of the difference (that has physical significance) between two 
means (6) 


e the significance level of the test (a), or the probability of Type I error 


e the power of test 1 — (, or the probability of detecting 6, where # is the 
probability of a Type II error 


The required replication number for each treatment group, r, for two-sided 
alternatives is estimated with 


r > 2lza/2 + 2) (<) (1.1) 


where 24/2 is the standard normal variate exceeded with probability a/2 and zg is 
exceeded with probability G. Probabilities for the standard normal variable are 
found in Appendix Table I. 

The replication number can be estimated with knowledge of the percent coeffi- 
cient of variation, %CV. The %CV is substituted for o in Equation (1.1), where 
%CV=100(c/p). The difference (6) must be expressed as a percentage of the over- 
all expected mean of the experiment, %é6 = 100(6/2), in Equation (1.1). 

The influence of the coefficient of variation (%CV), the percent difference 
(%6), the power (1 — (@), and the significance level (a) on required numbers of 
replications is shown in Table 1.1. Although the values in Table 1.1 only apply to 
two independent samples, the influences are similar in more complex experiments. 

Required replication numbers generally increase if 


e the variance, %CV or o?, increases 

e the size of the difference between two means, %é6 or 6, decreases 

e the significance level of the test, a, decreases 

e the power of test, 1 — (, increases 

Calculated values for required replication numbers are estimates and approxi- 


mations. Often they are determined on the basis of variance estimates associated 
with previous studies and not the variances from the actual study that will be used 
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Table 1.1 Number of replications required for a given coefficient of variation 


(%CV) and probability (1 — @) of obtaining a significant difference of %6 be- 
tween two treatment means with a two-sided test at the a significance level 


aa205 a= .01 
% %6 
%CV i-f ~ (eee eee lL CL 
5 80 4 I 6 2 
95 7 2 9 3 
10 80 16 4 24 6 
95 26 7 36 9 


to compute confidence intervals and tests of hypotheses. The determination of 
replication numbers for analysis of variance applications will be considered in 
Chapters 2, 5, 6, and 7. Commercial software programs are available to determine 
replication numbers for many different types of experimental studies. 


1.8 Randomization for Valid Inferences 


In reconciling thus the two desiderata of the reduction of error and of the valid 
estimation of error,...no principle is in the smallest degree compromised. An 
experiment either admits of a valid estimate of error, or it does not; whether it does so, 
or not, depends not on the actual arrangement of plots, but only on the way in which 
that arrangement was arrived at. (Fisher, 1926) 


Replication of the experiment provides the data to estimate the experimental error 
variance. Blocking provides a means to reduce experimental error. However, repli- 
cation and blocking alone do not guarantee valid estimates of experimental error 
variance or valid estimates of treatment comparisons. 

Fisher (1926) was making the point that randomization, alone, in the experi- 
ment provides a valid estimate of error variance for justifiable statistical inference 
methods of estimation and tests of hypotheses. Randomization is the random 
assignment of treatments to experimental units. 


A Rationale for Randomization 


Our analysis of data from experiments assumes the observations constitute a ran- 
dom sample from a normally distributed population. This assumption is plausible 
for comparative observational studies that use random samples of the available 
observation units from different treatment populations. However, whether experi- 
mental units can be considered a random sample is questionable when they are 
carefully selected, controlled, and monitored in experiments. 


1.8 RANDOMIZATION FOR VALID INFERENCES 21 


Independent observations are critical for estimation and tests of hypotheses 
because they provide valid estimates of experimental error variance. But, the as- 
sumption of independence among the experimental units cannot be justified when 
relationships exist among them. For example, it is well known that field plots tend 
to respond more similarly when they are adjacent. Any type of proximity can pro- 
duce correlated responses whether it be physical location of units or temporal 
performance of tasks on the units. 

Fisher (1926) recognized these potential difficulties with field plot experiments 
and justified random assignment of treatments to experimental units as the means to 
obtain valid estimates of experimental error variance. In a more detailed discussion, 
Fisher (1935) showed that randomization provided appropriate reference popula- 
tions for statistical inferences free of any assumptions about the distribution of the 
observations. He showed that significance tests could be based on the distribution 
created by randomization and that the normal theory tests provided reasonable 
approximations to these test results. Thus, the random allocation of treatments to 
the experimental units simulates the effect of independence and permits us to 
proceed as if the observations are independent and normally distributed. These 
randomization tests, illustrated in this section, form the basis for valid statistical 
inferences in properly randomized experiments. 

Further justification for randomization (Cochran & Cox 1957; Greenberg 
1951; and Ostle & Mensing 1975) is based on the need to eliminate biases in the 
comparison of treatments that arise through systematic assignment of treatments to 
experimental units. If, for example, procedure A is always performed before proce- 
dure B, any systematic variation over time will bias the resulting comparisons be- 
tween A and B. Randomization over these potential systematic sources of variation 
ensures estimates of treatment means differ from true values only by random 
variation. 


Randomization Tests Show Utility of Randomization 


The utility of randomization can be demonstrated with a randomization test that 
makes no assumptions about the form of the probability distribution for the obser- 
vations. Randomization creates a population of experiments that could have been 
performed, although only one arrangement has been chosen at random for the 
actual experiment. 

The randomization test evaluates the test statistic for all possible arrangements 
of treatments on the experimental units. The randomization distribution is the dis- 
tribution of those values that would be obtained under the null hypothesis of no 
treatment effects. 


Example 1.3 Illustration of a Randomization Test 


Consider an experiment in which two treatments are randomly assigned to 
seven experimental units. Four experimental units receive treatment A and 
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three experimental units receive treatment B, with the following set of 
responses: 


Unit: 1 a 3 4 5 6 7 
Treatment: B B A A B | 
Response: 14 16 19 17 15 13 17 


If there is no difference between the effects of treatments A and B, then 
they are merely labels on the experimental units and do not affect the results. 
For example, if the null hypothesis is true, the response for unit 1 would be 
14 regardless of the treatment applied. The same is true for each of the other 
units under the null hypothesis. 

The A and B labels (four A’s and three B’s) may be allocated to the seven 
experimental units in 7! /4!3! =35 possible arrangements. These are the 35 
experiments possible if the treatments are randomly allocated to the units. 

All 35 of the possible arrangements are shown in Table 1.2, along with the 
difference between the group means (¥, — ¥g) based on the labels assigned 
to the units in each arrangement. The 35 differences (y¥, — Y,) are 35 possi- 
ble differences that could occur for the 35 possible randomizations if the null 
hypothesis is true. They make up the randomization distribution under the 
null hypothesis. | 


Table 1.2 Thirty-five possible arrangements of four A’s and three B’s to seven 
experimental units with the mean difference 7, — Yp 


YA — YB 
— 3.17 
— 2.58 
— 2.58 
— 2.00 
— 2.00 
— 1.42 
— 1.42 
— 1.42 
— 1.42 
— 0.83 
— 0.83 
— 0.83 
— 0.83 
— 0.25 
— 0.25 
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Table 1.2 (Continued) 


Arrange- Unit: 1 Z 2 “ 5 6 7 
ment Response: 14 16 19 17 15 13 17 Ya— Up 
16 A B B A A B A — 0.25 
17 B A A B A A B — 0.25 
18 B A B A B A A — 0.25 
19 A A A B A B B 0.33 
20 A A B A B B A 0.33 
21 B B A A A A B 0.33 
Ze B B A B A A A 0.33 
23 A B A A A B B 0.92 
24 A B A B A B A 0.92 
23 B A A A B A B 0.92 
26 B A A B B A A 0.92 
pa B A B A A B A 0.92 
28 A A A A B B B 1.50 
29 A A A B B B A 1.50 
30 B B A A B A A 1.50 
31 A B A A B B A 2.08 
52 B A A A A B B 2.08 
55 B A A B A B A 2.08 
34 B B A A A B A 2.67 
i B A A A B B A 3.25 


Consider an alternative hypothesis H,: 44 — ug # 0 to the null hypothesis 
Ho: 4A — LB = 0. The arrangement of the actual experiment is arrangement 
30 with a mean difference (¥, — Y¥z) = 1.50. An absolute difference of 1.50 
or larger occurs with 13 arrangements. Under the null hypothesis an absolute 
mean difference of 1.50 or larger occurs with a frequency of =, yielding a 
significance level of 0.37. On the basis of the observed results of the 
experiment, arrangement 30, there is no reason to reject the null hypothesis. 

Under the assumption of a true null hypothesis, Ho: u44 — ug = 0, the 
randomization test enabled an evaluation of the test statistic from the jot 
experiment, (Y¥4 — Yg) = 1.50, against the values for ¥, — Yp from all other 
members of the population of 35 possible experiments. 


Normal Theory Tests Approximate Randomization Tests 


me 


Fisher (1935) first demonstrated that normal theory tests are good approxima- 
tions to the randomization tests provided there has been a random allocation of the 
treatments to the experimental units and sample sizes are reasonably large. 
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Approximations to the randomization test by normal theory tests improve as sample 
size increases. Practical guides to randomization tests for various experimental 
situations may be found in Edgington (1987) and Manly (1991). 

Rigorous treatments of the randomization models and tests of significance in 
experiment designs are provided by Kempthorne (1952), Scheffé (1959), Mead 
(1988); and Hinkelmann and Kempthorne (1994). More detailed discussions of 
randomization in connection with statistical inference may be found in Kempthorne 
(1966, 1975). 

It has become common practice to describe the statistical models for experi- 
mental studies in terms of the normal theory models. The formalities of the normal 
theory models are more straightforward than those for the randomization models, 
however, the use of normal theory models for experiments can be justified only 
under the randomization umbrella. 


Restricted Randomization for Difficult Circumstances 


Randomization may result in an arrangement of treatments that is unsatisfactory 
from the point of view of scientific validity. Sequential arrangements, such as AAA 
BBB CCC or ABC ABC ABC, are possible with random assignment. However, the . 
sequential arrays of treatments can lead to problems of bias. 

Hurlbert (1984) questions the blind use of randomization in small-scale eco- 
logical studies. In small studies, there is a very high possibility of randomized lay- 
outs resulting with treatments markedly segregated from each other in space or 
time. Segregation could lead to spurious treatment effects in which treatment and 
space effects are confounded. Hurlbert (1984) argues that there must be some 
physical interspersion of the treatments to avoid the systematic segregation of treat- 
ments in small experiments. 

Yates (1948) and Youden (1956) a introduced restricted randomi- 
zation as a solution to the problem of bad patterns from complete randomization. 
Restricted randomization omits certain arrangements of treatments that in the 
opinion of the experimenter are unacceptable for the particular study under 
consideration. 

Industrial experiments may require elaborate dismantling of experimental ap- 
paratuses between certain types of treatments. Youden (1972) gave examples of 
industrial and laboratory experiments in which the cost of switching from one treat- 
ment to another might outweigh the advantages of complete randomization. 

Bailey (1986, 1987) discussed some history and recent research on the topics 
of restricted randomization. Schemes for restricted or constrained allocation of 
treatments have been developed that permit the usual analysis of variance 
procedures (Bailey, 1986; Youden, 1972). 
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1.9 Relative Efficiency of Experiment Designs 


Relative efficiency measures the effectiveness of blocking in experiment designs to 
reduce experimental error variance. In practice, relative efficiency is measured to 
determine the efficiency of the design actually used relative to another simpler 
design that could have been used but was not. For example, the efficiency of a ran- 
domized complete block design is determined relative to a completely randomized 
design. 

The variance of a treatment mean of = o*/r is a measure of the precision of 
the estimated treatment means in an semaine The — of an estimate for 
the treatment mean is controlled by the magnitude of o? and the number of replica- 
tions, r, both of which are to some extent under the control of the se emcees The 
investigator may increase the number of replications to decrease a7 and hla 
the precision of the mean estimate. The investigator may also aT to reduce o? 
through various local control activities (such as blocking), thereby increasing the 
precision of the experiment. A method for measuring the effectiveness of blocking 
is discussed in this section. 

The use of os as a measure of precision provides a means for comparing the 
relative precision of two experunent designs. Suppose one design has a true experi- 
mental error variance of o? = 1, and a second design has a true experimental error 
variance of o2 = 2. The value of oz = 0°/r will be the same for the two designs if 
the second design has twice as mae replications as the first design. That is, the 
variance of a treatment mean in each of the designs is 


2 
; oO I 
Design 1: of = — = — 
ow EY 


and 


2 2 
. een. ome y ee 
Design 2: ==> 


The variances oF and o5, will be the same only if rz = 2r,, or Design 2 has 
twice the replication as that of Design 1. Therefore, Design 1 is more efficient than 
Design 2 with respect to the number of replications required to have the same preci- 
sion for an estimate of the treatment mean. 

In practice, o? is unknown for each of the designs and must be estimated from 
the data. Also, the degrees of freedom for the estimate of the variance changes with 
the designs. Under these circumstances the relative precision of the two designs is 
determined on the basis of the concept of information (Fisher, 1960). Fisher calcu- 
lated the amount of information that the estimated difference between two means 
provides about the true difference between the population means. Information cal- 
culated from this concept is 
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ea +3) ) (1.2) 


where s? is the estimated experimental error variance with f degrees of freedom. If 
o” is known, then J = 1/o7and the coefficient (f + 1)/(f +3)is replaced by 
unity. For any reduction in variability, s*, there is a concurrent increase in the 
information one has about the mean difference of the populations. Precision and 
information both increase as the variability decreases. 

The relative efficiency of two experiment designs is defined as the ratio of 
information i in the two designs. Suppose 


(iti 1 de 
a ONES ge 


are the estimated information measures of Design | and Design 2, respectively. The 
relative efficiency of Design 1 to Design 2 is estimated as 


pont wht te (1.3) 


In (fi +30 f2 +1) 87 


If RE = 1, then the information in the two designs is equal, and the designs 
each require me same number of replications to have the same variance of treat- 


“ment means o;. If RE > 1, then Design 1 is more efficient than Design 2. For 


example, if RE = 1.5, then Design 2 requires 1.5 times as many replications as 
Design 1| to have the same variance of a treatment mean. 


1.10 From Principles to Practice: A Case Study 


The design of an experimental medical study reported by Moon et al. (1995) illus- 
trates the process of putting principles into practice. The components of the study 
design provide examples of how the principles of research design, covered in this 
chapter, helped prepare the investigators to address their research hypothesis. The 
following is a brief description of the main research design elements in that study. 
Other details on the study can be found in the publication. 


The Problem 


Non-melanoma skin cancer, which includes basal cell (BCC) and squamous cell 
(SCC) carcinomas, is the most common type of cancer. Residents of Arizona 
experience a three to seven times greater incidence of these cancers than the general 
U.S. population. Although non-melanoma cancers are usually not life threatening, 
they result in substantial morbidity and treatment expense. 

A history of actinic keratoses (AK), a type of skin lesion, has generally been 
accepted as a marker for identifying individuals at increased risk of skin cancer. In 
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many cases AK lesions progress to non-melanoma skin cancer and are thus prema- 
lignant lesions. Those individuals who have a history of actinic keratoses, but few if 
any non-melanoma skin cancers, are considered to be at moderate risk. Also, fair- 
skinned, older people with a long history of sun exposure are more at risk for non- 
melanoma skin cancer. 

Results from recent clinical and laboratory studies suggested that vitamin A 
and other retinoids have a preventative effect against cancer in epithelial tissues 
such as skin. However, these studies included a small number of subjects and did 
not produce reliable estimates of the vitamin A effect in the primary prevention of 
human skin cancer. 2 

A five-year clinical trial was to be conducted in southern Arizona to evaluate 
the effectiveness of vitamin A or retinol supplementation to reduce the risk of non- 
melanoma skin cancer for individuals already at moderate risk. 


Research Hypothesis 


Retinol (vitamin A) supplementation reduces the incidence of skin cancer in mod- 
erate-risk individuals with a history of at least ten actinic keratoses. 


Treatment Design 


Important considerations for the dosage level of retinol (vitamin A) included the 
need to elevate the daily intake of retinol above the common intake of most adults 
and to avoid a dosage that could potentially induce adverse side effects known to be 
associated with excessive retinol intake. A placebo treatment was necessary as a 
comparison group for the treated group; these were subjects with the same charac- 
teristics as the treated subjects and on the same follow-up protocols during the 
course of the study. The subjects could not be told to which treatment group they 
were assigned in order to keep the subjects from both groups on the same regimen. 


Treatment: Daily, self-administered, dietary supplement of 25,000 IU retinol 
in capsule form. 


Placebo: Daily, self-administered, placebo capsule 


Measurements of Interest 


The hypothesis addressed the risk of skin cancer in its relationship to levels of reti- 
nol. Therefore, the measurement of interest was whether a subject developed skin 
cancer during the course of the study. The analysis could consider several ap- 
proaches to test the hypothesis. The approaches included whether a cancer 
developed, a binary outcome; how many cancers developed, a count variable; or 
how long it took for a cancer to develop, which is a time-to-event measurement 
used in survival analysis. Each of these approaches could be extracted by recording 
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the time it took to develop a cancer, if one developed. Thus, the measured variables 
were the 


e time to the first and each subsequent, if any, basal cell carcinoma (BCC) 


e time to the first and each subsequent, if any, squamous cell carcinoma 
(SCC) 


Selection of Subjects with Common Characteristics 


Subjects for this study were required to be representatives of a healthy adult popu- 
lation with a moderate risk to non-melanoma skin cancer, willing to participate in 
the study, and not currently ingesting an excessive amount of vitamin A in their 
diet. More than 11,000 subjects were screened for the study; approximately 25% 
were deemed eligible. Following are some of the criteria used by the investigators 
to screen the subjects. 

The subjects were recruited through referrals by dermatologists and media 
announcements. The subjects could be men or women with a history of at least ten 
actinic keratoses diagnosed clinically, with the most recent diagnosed in the past 
year. They could have had no more than two prior occurrences of SCC or BCC and | 
no cancer diagnosis other than SCC or BCC in the previous year. 

The eligible individuals had to be between 21 and 84 years of age, be ambula- 
tory, be capable of self-care, have no diagnosis of a life-threatening disease, intend 
to be a resident of Arizona for the next five years, and be willing to commit to 
semiannual follow-up clinic visits for that period. They also had to be willing to 
limit non-study vitamin A supplementation to no more than 10,000 IU per day. 
Further, they had to be within the 95% normal range for total cholesterol, liver 
function, white blood cell count, hemoglobin, and platelet count. 


Some Techniques to Reduce Nonrandom Error and Bias 


Several precautions had to be taken to ensure that nonrandom error and bias in re- 
sponse was minimized in the study. This included assurance that the subjects would 
take the medication regularly, retain their willingness to remain in the study, return 
for their follow-up clinical visits for evaluation, and remain unaware of the treat- 
ment group to which they were assigned. 

Conceivably, if subjects knew they were in the placebo group they could self- 
administer extra vitamin A on their own accord with the hope of reducing their per- 
sonal risk and unknowingly bias the comparison of treatment to placebo. The 
investigators provided an insurance against this sort of event by not revealing to the 
subject « or + dite clinician dispersing the capsules whether the subject was receiving 
the treatment or placebo. This type of trial is known as a double-blind trial. 

A three-month run-in period was established to evaluate the subjects’ ability 
and willingness to adhere to the study protocol. They received a bottle containing 
100 placebo capsules, of which they were to take one per day. Subjects who had 
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taken at least 75% of the capsules during the run-in period and were willing to 
continue the study were assigned to a treatment or placebo regimen, given a six- 
month supply of the appropriate capsules, and scheduled for a follow-up visit. 

Subjects returned to the clinic every six months to receive an examination for 
any symptoms of BCC or SCC. As a safety measure, they were also examined for 
any potential side effects from elevated retinol intake. The subject’s remaining 
medication was weighed to evaluate adherence to dosage. During the interview, 
subjects’ questions were answered and they were motivated to adhere to the medi- 
cation schedules. They were then given the next six months’ supply of capsules and 
scheduled for the next follow-up visit. Subjects were reminded of their upcoming 
follow-up visit by postcard and telephone contact. 

Once a year a blood specimen was collected from a random sample of subjects 
for analyses of retinyl palmitate levels to obtain supplementary information on 
group adherence to the retinol supplement. The levels of retinyl palmitate should 
have been greater for the treatment group than the placebo group if the treatment 
group had adhered to the capsule regimen. 


Replication 


The trial was conducted from two separate clinics, one each in Tucson and 
Phoenix, Arizona. The number of subjects required for the study was based on as- 
sumptions about the average annual incidence of skin cancers in the placebo and 
retinol treatment groups and incidences of anticipated deviation from the pre- 
scribed study protocols by the subjects. The required sample size was determined to 
be 1118 subjects in each of the treatment groups and was based on a power of .80 
and a .05 two-sided Type I error rate, using techniques specific to studies on time- 
to-event measurements. 


Blocking to Reduce Experimental Error 


A subject’s risk to non-melanoma skin cancer was anticipated to be related to the 
amount of time spent in the sun and whether the person had fair skin. Fair-skinned 
people are anticipated to be at greater risk for skin cancer and will sunburn more 
readily in a 30-minute period in southern Arizona. Anyone who spends a greater 
amount of time in the sun is anticipated to be at greater risk for skin cancer. 
Information on weekly sun exposure and anticipated skin-burning reaction after 30 
minutes of sun exposure was collected from each subject during the first contact. 

These were the most likely factors to interfere with risk comparisons between 
retinol treatment and placebo, so sia were used as blocking factors prior to assign- 
ment of treatment to subjects. Subject categorized according to two levels of | 
sun exposure: < 10 hours ms Ss 10 hours per week. They were also grouped 
according to levels of skin reaction after 30 minutes: always or usually burns versus 
burns moderately, rarely, or never. 

Thus, subjects were placed into one of the four block types constructed when 
the two factors were placed in all four combinations of their levels. For example, 
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subjects exposed to the sun < 10 hours per week and whose skin moderately, 
rarely, or never burned after 30 minutes exposure to the sun would be placed in the 
same block. Equal numbers of subjects would be assigned to the retinol treatment 
and placebo in each of the blocks; thus, any potential differences between retinol 
treatment and placebo would not be interfered with by differential subject risks 
based on sun exposure and skin sensitivity to the sun. 


Randomization 


Subjects enter into clinical trials over a period of time as they are identified by their 
physicians as candidates for the study and as they respond to calls for subject vol- 
unteers. Therefore, the assignment of subjects to treatments sometimes takes place 
over a period of one or more years until the required number of subjects (replica- 
tions) has been achieved for the study. For this study, the enrollment period 
required more than four years to acquire a sufficient number of subjects. 

The subjects were assigned to their respective blocking type (described above) 
as they entered the study. They were randomly assigned to the placebo or retinol 
treatment by groups of four subjects with the same blocking criteria. For example, 
if the first two subjects entering in a block were assigned at random to the placebo 
the next two would automatically receive the retinol treatment. Then the randomi- 
zation would start over with the next four subjects entering into any one of the 
blocking groups. This method of assignment in blocks of four subjects ensured an 
equal distribution of subjects on placebo and retinol treatment over the entry time- 
line of the trial, effectively removing the chance of having a greater number of 
subjects on one treatment for a longer period of time than on another treatment. 

The randomizations were done separately for the clinics in Tucson and 
Phoenix. Thus, the clinics became a de facto blocking factor in the study. 


Measured Covariates for Statistical Control of Experimental Error 


Numerous other factors could conceivably have some relationship to risk for skin 
cancer. Those factors thought to have the greatest potential for affecting the com- 
parison of risk between retinol treatment and placebo were used as blocking fac- 
tors: sun exposure and skin sensitivity to sun exposure. To have included more 
factors in the blocking scheme might have made the study unnecessarily cumber- 
some, particularly if no hard evidence suggested these factors had major impacts on 
skin cancer risk. 

Other factors the investigators considered to be potentially influential and for 
which measurements were recorded included age, gender, prior skin cancers (0, 1, 
or 2), number of moles and freckles, vitamin use, dietary vitamin A at the start of 
the study determined from a diet interview, and serum retinyl palmitate at the start 
of the study. 

These factors could have been used as covariates in the data analysis to reduce 
experimental error for comparisons of retinol treatment to placebo. A _post- 
randomization check after the four-year enrollment period was made to determine 


1.10 FROM PRINCIPLES TO PRACTICE: A CASE STUDY 31 


Table 1.3. Cross tabulations of subjects by treatment groups and covariates in the 
skin cancer prevention trial 


Placebo Retinol 
Characteristic n.= 1140 ae L157 
Age 
=< 63 558 584 
> 63 582 573 
Gender 
Female 345 334 
Male 795 823 
Prior skin cancers 
0 932 920 
l b§2 177 
2 45 45 
= ei 15 
Moles and freckles 
0-7 550 541 
ae | 301 326 
Unknown 289 290 
Weekly sun exposure 
0—10 hours 452 493 
> 10 hours 688 664 
Skin burns 
Always/usually 490 ee es 
Moderately/rarely/never 650 640 
Clinical center | 3 
Phoenix 429 420 
Tucson 711 737 
Vitamin use 
No 309 312 
Sometimes 322 331 
Yes 509 514 
Dietary vitamin A (IU) 
1,194-6,979 337 355 
6,980—10,627 324 : 369 
10,628-41,404 365 326 
Unknown 114 107 
Serum retinyl palmitate (mg/ml) 
0.0-6.0 397 400 
6.1—20.0 345 346 
> 20.0 350 378 


Unknown 48 33 
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whether the subjects in placebo and retinol treatment groups were equally distrib- 
uted with respect to each of these factors. The cross tabulations shown in Table 1.3 
indicate a relatively equal distribution of subjects on placebo and retinol treatment 
for each of the major covariates considered in the study. Notice 26 subjects with 
more than two previous skin cancers were enrolled in the study even though they 
did not meet the selection criterion of no more than two previous skin cancers. Of 
course, these could be either recording errors or an oversight by the clinicians who 
could have mistakenly allowed these subjects to enroll. The reason for these types 
of mistakes often remains unknown. 


EXERCISES FOR CHAPTER 1 


1. a. Find the definitions of research and hypothesis in a dictionary, and formulate a definition of 
the term research hypothesis. 
b. How does a research hypothesis differ from the statistical hypothesis formulated for statistical 
tests, such as Hp. wp = O versus H,: p #0? 


2. Choose a journal article in your field of study that reports the results of a comparative experiment 
or observational study. Identify and briefly (one or two sentences) describe the following: 

Research hypothesis 

Treatments 

Experimental (observational) units 

Type of experiment design 

Criteria for any grouping, blocking, or matching done in the study 

Provide the article citation 


moao sp 


3. a. Choose a practical situation from your own field of study and describe a problem whose 
solution must be determined experimentally. 
b. Indicate the following for the problem described in part (a): 
(i) aresearch hypothesis 
(ii) the treatments necessary to evaluate the hypothesis 
(iii) what constitutes an experimental unit 


4. Choose a journal article from your special field of interest and review it for the purpose of 
evaluating the application of good research design. Many aspects of research design have been dis- 
cussed separately in this chapter relative to their effect on statistical and scientific inference. The 
case study in Section 1.10 illustrates the elements of a reported research study that one must 
identify for a validation of the reported findings from a research project. 
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Choose an article that reports on either an experiment conducted to compare two or more treat- 
ments or a comparative observational study conducted to compare two or more “treatment” condi- 
tions that existed a priori. 

Some good questions to ask yourself for the critique are “Did they include all important ele- 
ments of good research design in the study?” “If so, did they implement them properly?” “Are the 
elements of the work described in such a way that I could understand or duplicate what they did?” 


Pay particular attention to the following items in your review: 


Review of the literature 

Statement of the problem 

Research hypothesis and study objectives: discuss whether they were present and reasonable 
Treatment design: describe how it did or did not address the hypothesis and objectives 
Experiment design or observational study design 

Use of randomization (experiments) or random sampling (observational study) and replication 
Statistical hypotheses and statistical analysis procedures 

Conclusions and statistical reliability of conclusions 

Self-evaluation of the study by the author(s) and potential for future investigations 

Provide the article citation 


cor ge rho ao o Pp 


Your critique should describe and evaluate the author’s approach in the research itself and in the 
article relative to each of the important elements of the research based on the preceding list. Include 
a page reference from the article for your comments on each of the items. 


A study is planned on the physiology of exercises with human subject volunteers. The two treat- 
ments in the study are two methods of aerobic exercise training (call the methods A and B). At the 
end of a ten-week exercise period, each subject will undergo a treadmill test for standard respi- 
ratory and cardiovascular measurements. 


Individual Sex Age Individual Sex Age 
| M 54 10 M 18 
2 M 38 11 M 31 
3 F 4] 12 F 18 
4 F 18 13 M 58 
5 F 19 14 M 74 
6 F 39 15 F 58 
7 M 51 16 F 21 
8 F 44 iB: M 35 
9 M 18 M . 34 

19 F 38 
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Nineteen volunteers are listed in the table by sex and age. All volunteers are in good health and 
in the normal weight range for their age, sex, and height. Eight individuals will be tested in each of 
the methods (A or B), so that only 16 of the 19 volunteers will be used; a subject will participate 
only in one of the methods. 

a. Explain how you would group the individuals prior to the assignment of treatments so that 
experimental error variance could be kept at a minimum. 

b. Explain why you grouped as you did. 

c. Show your final assignment of individuals to the treatment groups. 


6. An experiment is planned to compare three treatments applied to shirts in a test of durable press 
fabric treatments to produce wrinkle-free fabrics. In the past formaldehyde had been used to pro- 
duce wrinkle-free fabric, but it was considered an undesirable chemical treatment. This study is to 
consider three alternative chemicals: (a) PCA (1-2-3 propane tricarbolic acid), (b) BTCA (butane 
tetracarboxilic acid), and (c) CA (citric acid). 

Four shirts will be used for each of the treatments. First, the treatments are applied to the 
shirts, which are then subjected to simulated wear and washing in a simulation machine. The chem- 
ical treatments will not contaminate one another if they are all placed in the same washing machine 
during the test. The machine can hold one to four shirts in a single simulation run. At the end of the 
simulation run each of the shirts is measured for tear and breaking strength of the fabric and how 
wrinkle-free they are after being subjected to the simulated wear and washing. The comparisons 
among the treatments can be affected by (a) the natural variation from shirt to shirt; (b) measure- 
ment errors; (c) variation in the application of the durable press treatment; and (d) variation in the 
run of the simulation of wear and washing by the simulation machine. Following is a brief descrip- 
tion of three proposed methods of conducting this simple experiment. 


Method I. The shirts are divided randomly into three groups of four shirts. Each group receives a 
durable press treatment as one batch and then each batch is processed in one run of the simulation 
machine. Each run of the simulation machine has four shirts that have received the same treatment. 
There are three runs of the simulation machine. 


Method II. The shirts are divided randomly into three treatment groups of four shirts each, and the 
durable press treatments are applied independently to single shirts. The shirts are grouped into four 
sets of three, one shirt from each durable press treatment in each of the four sets, and each set of 
three so constructed is used in one run of the simulation machine. There are four runs of the 
simulation machine. 


Method III. The shirts are divided randomly into three groups of four shirts. The durable press 
treatments are applied independently to single shirts. The simulation of wear and washing is done 
as in Method I. 

a. Which method do you favor? 

b. Why do you favor the method you have chosen? 

c. Briefly, what are the disadvantages of the other two methods? 


10. 


11. 


12. 
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Explain what is meant by the term replication in the context of (a) an experiment in which the ef- 
fectiveness of several antibiotics is tested on animals in a laboratory and (b) an observational study 
to determine the differences in grass species present in pure stands of mesquite and pure stands of 
oak in southern Arizona. 


An experiment is planned to compare three methods of instruction. Each is tested with a single 
classroom of 25 students. A different instructor is to be used for each classroom and consequently 
each instruction method. 

a. Write a short critique of the proposed experiment. 

b. How could the experiment be improved? 


An experiment is planned to compare the strengths of three different asphalt mixtures for road 
surfaces. A single batch is to be manufactured for each experimental mixture. Several asphalt 
specimens will be made from each of the mixtures and tested for tensile strength. 

a. Write a short critique of the proposed experiment. 

b. How could the experiment be improved? 


Suppose you want to randomize the allocation of two treatments to 16 experimental units. How 
many randomizations are possible if 8 units are to be assigned to each of the treatments? How 
many randomizations are possible if you want to assign 6 units to one treatment and 10 units to a 
second treatment? 


An experiment with four treatments and five replications of each treatment will require 20 experi- 
mental units. How many randomizations are possible for this experiment? 


An experiment with two treatments and three replications per treatment had the following ran- 
domization of treatments to the experimental units shown along with the measured response on 
each unit: 


Sos | 2 3 4 
Treatment: A B B A A B 
Response: 7 10 9 2 10 12 


Conduct a randomization test of the null hypothesis, Ho: no difference in the treatment effects of A 
and B, versus the alternative, H, : the effect of treatment B is greater than the effect of treatment 
A. Use the test statistic J7p—Y,. (Hint: It is only necessary to identify one-half of the 
randomizations directly. Each randomization has a “mirror” randomization in which the letters A 
and B are interchanged. For example, the “mirror” for the actual experiment randomization is the 
randomization B, A, A, B, B, A.) 
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13. In the table, the coefficients of variation and relative efficiencies (randomized complete block to 
completely randomized) of the same experiment conducted at four locations are given. Each trial 
used a randomized complete block design. 


Location Coefficient of Variation (%) Relative Efficiency (%) 
Tucson 10 100 
Phoenix 10 150 
Los Angeles 20 200 
San Francisco 20 125 


a. How many more replications of a completely randomized design would be necessary at Los 
Angeles to obtain the same precision as the randomized complete block design for estimating 
the treatment means? Explain your answer. 

b. If you were asked the question in part (a) relative to San Francisco, would you require more or 
fewer replications in San Francisco than for Los Angeles? Explain your answer. 

c. Suppose four replications are required with the randomized complete block design at Tucson 
to detect differences of 6 = 20% with a test at the .05 level of significance and a probability 
(power) of .90. How many replications would be required in Phoenix with the same criteria for 
a randomized complete block design? Explain your answer. 

d. Would you require more or fewer replications in Los Angeles than in Tucson with the same 
criteria for a randomized complete block design? Explain your answer. 


2 Getting Started with 
Completely Randomized Designs 


Chapter 1 presented the principles of design in relation to the stated goals of re- 
search hypotheses—accuracy and precision of the observations and validity of the 
resulting analysis. Some of those principles are illustrated in this chapter, using an 
experiment with a completely randomized design. A statistical model is developed 
with parameters to describe the experiment according to the research hypothesis. 
The parameters are then estimated, using the least squares method. Experimental 
error variance is estimated and used to estimate standard errors and confidence 
intervals for the parameters and to test statistical hypotheses about them. The 
fundamental partition for the sum of squares of the observations is derived and 
summarized in the traditional analysis of variance table. 


2.1 Assembling the Research Design 


The research hypothesis, treatment design, and experiment or observational study 
design constitute the research design for the study. Treatments are developed to 
address specific research questions and hypotheses that arise in the research pro- 
gram. For example, a microbiologist hypothesizes that the activity of soil microbes 
depends upon soil moisture conditions. Treatments with different amounts of soil 
moisture are set up to measure the microbe activity at different levels of soil mois- 
ture to evaluate the hypothesis. A traffic engineer hypothesizes that traffic speed is 
related to the width of street lanes. Lanes of different width are selected by the 
engineer, and traffic speed is measured at each lane width to evaluate the 
hypothesis. 

The treatment design has to be integrated into an experiment design. The 
investigator must decide what constitutes an experimental unit, how many 


a7 
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replications of experimental units are required for each treatment, and which 


treatment to assign to each of the experimental units. The investigator must also 
determine whether the experimental units should be blocked into homogeneous 
groups to control experimental error. 

Similarly, the comparative observational study on association of traffic speed 
with lane width requires the investigator to determine how many independent street 
units of each width are required for the study and how to match the street units for 
controlling variables. 

The computational details of the statistical analysis of various designs may 
vary from one design to another. Many of the statistical procedures used for analy- 
sis are common to most of the designs that we encounter. This is so because the 
procedures themselves generally relate to specific treatment designs, each of which 
may appear in any number of experiment design configurations. | 

The intent of this chapter and Chapter 3 is to introduce useful statistical proce- 
dures for a variety of comparative studies. The procedures are extended to more 
complex treatment designs in the ensuing chapters, and their application in other 
experiment designs is demonstrated as the designs are presented. The procedures 
are illustrated first in this chapter with a one-way classification of treatments in a 
completely randomized design with equal replication of treatments. 


Example 2.1 Suppression of Bacterial Growth on Stored Meats 


The shelf life of stored meats is the time a prepackaged cut remains salable, 
safe, and nutritious. Standard packaging in ambient air atmosphere has a shelf 
life of about 48 hours after which the meat quality begins to deteriorate from 
microbial contamination, color degradation, and shrinkage. Vacuum pack- 
aging is effective in suppressing microbial growth; however, other quality 
losses remain a problem. 


Recent studies suggested controlled gas atmospheres as possible alternatives 
to existing packagings. Two atmospheres which promise to combine the ca- 
pability for suppressing microbial development while maintaining other meat 
qualities were (1) pure carbon dioxide (CO), and (2) mixtures of carbon 
monoxide (CO), oxygen (O2), and nitrogen (N). 


Research Hypothesis: Based on this new information the investigator hy- 
cahesiaad that some form of controlled gas atmosphere would provide a 
more effective packaging environment for meat storage. 


reatment Design: The treatment design developed by the investigator to ad- 
diese the hypothesis included packagings with (1) ambient air in a commer- 
cial plastic wrap, @) a vacuum, (3) a mixture of gases consisting of 1% CO, 
40% O>, and 59% N, and (4) 100% CO. The ambient air and vacuum pack- 
agings served the = of control treatments because both were standards to 
which new packagings could be compared for effectiveness. 


Experiment sail A completely randomized design was used for the ex- 
periment. ‘ee beef steaks of approximately the same size (75g) were 
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randomly assigned to each of the packaging conditions. The randomization 
method is demonstrated in Section 2.2. Each steak was packaged separately 
in its assigned conditions. (For the purpose of this illustration the packaging 
_.__ treatments are evaluated for their effectiveness in suppressing bacterial 
growth.) The number of psychrotrophic bacteria on the meat was measured 
after nine days of storage at 4°C in a standard meat storage facility. Psychro- 
trophic bacteria are found on the surface of the meat and are associated with 
spoilage of the meat product. 

The results are shown in Table 2.1. Bacterial growth is expressed as the 
logarithm of the number of bacteria per square centimeter. 
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Table 2.1 Psychrotrophic bacteria [log (count/cm?)], on meat samples stored in 
four packaging conditions for nine days 


Psychrotrophic Bacteria 


Packaging Condition Log (count/cm?) Total Mean 
Commercial plastic wrap 7.66, 6.98, 7.80 22.44 7.48 
Vacuum packaged 5.26, 5.44, 5.80 16.50 5.50 
1% CO, 40% Oz», 59% N 7.41, 7.33, 7.04 21.78 7.26 
100% CO, 2.514291;:3.66 10.08 340 


Source: B. Nichols (1980), Comparison of Grain-Fed and Grass-Fed Beef for Quality Changes 
When Packaged in Various Gas Atmospheres and Vacuum, M.S. thesis, Department of Animal 
Science, University of Arizona. 
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Randomizing Treatments in Experiment Designs 


The steaks used for the experiment were relatively homogeneous experimental © 
‘units, and a completely randomized design was used to avoid any subjective assign- 
meet of treatments to the steaks. The proper randomization procedure for a com- 


pletely randomized design is illustrated with the meat storage study. 


Step 1. Assign the sequence of numbers | through 12 to the experimental 
units, he steaks, 


Step 2. Obtain a random permutation of the numbers 1 through 12 and write 
them down in the permutation order. A random permutation can be obtained by 
taking a sequence of two- or three-digit numbers from a random numbers table 
(Appendix Table XII) and ranking them from the smallest (1) to the largest (NV). 
The rank numbers in the sequence constitute a random permutation. Let each num- 
ber in the permutation order equal the sequence number of a steak. Suppose the 
permutation order is 
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Step 3. Assign the first three steaks in the list (1, 6, and 7) to treatment A. As- 
sign ‘the next three steaks (12, 5 and 3) to treatment B and so forth. The final 
assignment of steaks to treatments is 


Stenk: G4. 7 ol - Soe 3 od D9 oe Gd 
Treatment: «A cA. A. So, B.. Cac = BD dD 


navubonielate » Wii bbsthieretal Statistical Shputing programs include rou- 
tines for permutations and randomization. 
An equivalent method of randomization can be used in the absence of a com- 
_ puter program or a random numbers table. After assigning numbers to the experi- 
mental units as in Step 1, construct corresponding numbered slips of paper or tags 
and draw them from a hat at random. The first r numbers drawn are the experimen- 
tal units assigned to the first treatment. The second r numbers drawn are the ex- 
perimental units assigned to the second treatment, and so forth. A discussion of 
proper and improper methods of treatment assignment to units in a random fashion 
is discussed by Hader (1973). 


Selecting the Units for Comparative Observational Studies 


The traffic engineering study on street lane widths described at the beginning of the 
chapter is a comparative observational study. The investigator is unable to random- 
ly assign a unit to a treatment group in the comparative observational study. De- 
pending on the type of research program, the basic units of the investigation are 
either self-selected into the treatment groups or they exist in their characteristic 
groups. | 

A probability sample of units should be selected from available members of 
each treatment population. Units are selected from within each population such that 
each unit has an equal chance of entering the sample. Note that each population 
represents a separate treatment classification, and random sampling is maintained 
only within the population. 

The first step requires an identification of the populations that represent the 
conditions or treatments of interest in the observational study. A list is constructed 
of all available units in each population. In an ecological study, for example, the 
two treatments may be grassland and oak woodland plant communities. The popu- 
lations consist of all grassland sites and all oak woodland sites in a particular study 
area. Each of the sites in both populations is assigned a unique identification code. 
For example, assign the numbers 0 through 99 to the 100 sites available in each 
classification. 

Suppose the investigator wants to establish plots in ten grassland sites and ten 
oak woodland sites. The method for sampling the ten sites in each of the study 
areas begins with the selection of ten 2-digit numbers from a random numbers table 
(Appendix Table XII) for each of the sites. Suppose the first set of 2-digit numbers 
chosen is 12, 63, 34, 05, 97, 72, 42, 44, 82, 51. The sites so numbered in the study 
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area are used to establish measurement plots. The same procedure is followed for 
each treatment group. If the same random numbers are drawn more than once from 
the table, additional 2-digit numbers may be drawn at random from the table to 
complete the sample. Many commercial statistical computing programs include 
routines for random sampling from a list. 


2.3. Preparation of Data Files for the Analysis 


A fairly large amount of data are collected in most studies. The data first must be 
organized for use by computer programs. The execution of statistical programs on 
computers requires a data file. The file can be either typed into the terminal when 
the program is being run or it can exist in a file that previously has been put on a 
computer storage device with a separate data entry program. 

Each observation is identified clearly in the data file with a particular experi- 
mental unit and treatment classification in the study. The data file then has a conve- 
nient format for analysis and scrutiny of the data for any irregularities that may 
have occurred in either the procurement or recording of the observations. The data 
file for Example 2.1 might contain a sequence number identifying each beef steak, 
the treatment group to which the steak was assigned, and the observed bacterial 
count as shown in Display 2.1. 


Display 2.1 Data for Meat Storage Experiment 


Steak Treatment Log (count/cm?) 
Commercial 7.66 
Commercial 6.98 
Commercial 7.80 
Vacuum 5.26 
Vacuum 5.44 
Vacuum 5.80 
Mixed Gas 7.41 
Mixed Gas 7.33 
Mixed Gas 7.04 
CO» 351 
CO, 2.91 
CO» 3.66 


The entire data file consists of 12 lines (the titles are not part of the data file), 
each of which contains requisite information the computer program can utilize in 
record-keeping and performing calculations. 

Each line of the file is a case or an observation in programs. The meat storage 
data file has 12-cases or observations. Each column is a variable in the program. 


42 CHAPTER2 GETTING STARTED WITH COMPLETELY RANDOMIZED DESIGNS 


The variables in this data file are steak, treatment, and log (count/cm?). The actual 
values listed in each line of the file are often referred to as the data values. 

The only variables ordinarily required by a statistical program to perform 
calculations for this example are the treatment variable and the measured variable 
log (count/cm’). 

The steak variable is primarily a housekeeping variable used to identify the 
case in the data file with an experimental unit in the actual study. Housekeeping 
variables in a data file may include code numbers for the experimental unit, treat- 
ment names, dates, and experiment numbers. They are useful for management of 
large data files and for memory recall if the data file must be used long after its 
creation. 


2.4 A Statistical Model for the Experiment 


A Statistical analysis is based on an underlying formal statistical model. aa 
interpretation of the analysis requires an eed of the model. In compara- 
tive studies the characteristic of the units or subjects measured upon ob: servation i is 


the response variable, identified by me dakiaiile y. The bacterial count is the re- 
sponse variable in the meat storage experiment. 

The statistical model for comparative studies assumes there is a reference pop- 
ulation of subjects or experimental units. In most cases the population is conceptual 
A ie it is possible to imagine a eee of alata San retail stores, 


A ee population is assumed for each treatment condition in the study, 
and the units in an experiment are assumed to be randomly selected representatives 
of the reference population as a result of randomization. For observational studies, 
we assume the observational units are randomly selected from the treatment 
populations. 

The statistical model is illustrated in Figure 2.1 with four hypothetical treat- 
ment populations. Each population h nas a normal distribution for the response varia- 
ble, and each has a edition mean value. Such a situation would exist if the four 
methods of packaging meat had differing capacities for the inhibition of bacterial 
growth. 

The population variance co” is assumed to be the same for each of the | popula- 
tions and unaffected by the treatments as shown in Figure 2.1. . That is, ‘the vari- 
ances of the treatment populations are assumed to be homogeneous. 


Use of the Cell Means Model to Describe Observations 


ions are expressed as a sum of the treatment population means and the 


ps Ry errors with the geLLaweans model 
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M4 b2 pe4 M3 


Figure 2.1 Illustration of the treatment populations 


Lene ee eee ee ae 


where y;; denotes the jth observation from the ith treatment group; Hi is the mean 
of Se ith treatment population; and e;,,; is the experimental error. This is a linear 
atis model for the one-way treatment classification in a completely random- 

wy experiment design. 

The model makes an allowance for the variation among the observations from 
a given treatment group. Because of experimental error each observation deviates 
from the population mean py; by some amount e;;. The experimental error variance 
ois the variance of the e;;, and it is assumed to be the same for all treatment 
populations. 

The observations for the meat storage experiment are shown in Table 2.2, with 
t = 4 treatments and r = 3 replications with their identification y;; and statistical 
model representation. 


Use of Alternative Models to Describe Alternative Statistical Hypotheses 


The statistical model for the experiment reflects our beliefs about the relationship 
between the treatments and the observations. The cell means model is a full model, 
Yi; = Hi + e;;, that includes a separate mean for each of the treatment populations. 
A model with a reduced set of parameters is used if there are no differences among 
the treatment population means. The reduced model, yi; = 4 + e;;, states that the 
observations all belong to the same population with a mean p. 

The two models represent the alternative statistical hypotheses appropriate for 
the experiment. ‘he reduced model represents the null hypothesis ween,» with no 
lifferences among the treatment means, stated as Ao: py = fa = ++: = pt. The 
model represents the alternate hypothesis condition when there are some differ- 
ences among the treatment means, stated as H,: ; A uw, for somez # k. 

The investigator for the meat storage experiment must determine whether the 
bacterial growth differs among various packaging methods or whether one 
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Table 2.2 Identification of observed values from the meat storage experiment and 
their representation with the linear statistical model 


Steak Treatment Observation Log (count/cm*) His Model 
1 1 l 7.66 Yi1 fy + €11 
6 ] 2 6.98 Y12 [1 + e192 
7 l 3 7.80 Y13 [41 + €13 

tz Z l 5.26 Y21 2 + €2] 
5 Z z 5.44 Y22 2 + €22 
3 2 3 5.80 Y23 [2 + €93 
10 3 l 7.41 Y31 3 Sa €31 
9 2 2 to Y32 p13 + €392 
Z 3 3 7.04 Y33 fz + €33 
8 4 l 251 Y41 [4 + €41 
4 4 2 2.91 ie. pete 
11 4 saa 3.66 Y43 [4 + €43 


packaging method is no better than any other in suppressing bacterial growth. From 
the point of view of the statistical model the investigator must determine which of 
the two, the full model or the reduced model, best characterizes the data in the ex- 
periment. The research questions are translated into questions about statistical 
populations modeled for the experiment in Figure 2.2. 


One population OR several populations? 


(EEE LOREEN fre Sees (ee Se ee ee ee ee Cae ee ene 


p b4 2 bey 3 
Yij= + ej OR Yij = pj + @;;? 
Research question: Is there more bacterial growth with some packagings than with others? 


Statistical question: Which model best describes the experimental results? 


Figure 2.2 The research questions, statistical questions, and alternative models 
for the meat storage experiment 


To make some decision about the treatments the investigator requires a statisti- 
cal method to estimate the parameters of the two models : and, on the basis of an ob- 
jective criterion, determine which of the statistical hypotheses or models best fits 
the data from the experiment. 
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The General Linear Statistical Model 


he cell means model is a special case of the general linear model. The more gen- 
eral model describes relationships between two types of variables as a function 
linear in a set of parameters. One type of variable is the response variable y consid- 
ered to be dependent on the second type of variable consisting of the design 
variables x1, £2,..., x. The x; can be variables fixed by the treatment design, such 
as the level of a canea: treatment, or they can be measured covariates, such as 
the age of the subjects. The x; can also represent categorical treatments, such as 
those in the meat storage experiment. 
The statistical model relates y to the x; through a set of parameters, 
Bo, 1, G2, --., Gx, such that it is linear in the set of parameters or 


Y¥ = Bo t+ fiai + Pot, +++ + Pete +e -) 


Some simple examples will help clarify how the investigator can develop a model 
unique to a particular study. 

An experiment measures the rate of a chemical reaction, y, as a response to the 
temperature, 7’, in the reaction chamber. The investigator may hypothesize that the 
rate increases linearly with temperature. Letting +; = 7’, the simple straight line 
equation describes the linear relationship as 


y=Pot+firi te 


which is the familiar simple linear regression model. If the investigator thinks the 
rate-to-temperature relationship may be quadratic, then let x; = T and x2 = T” and 
the model becomes 


y = Bo t+ Bir + foto +e 


How does the model work if the treatments are categories that cannot be repre- 
aes by metrical values for the z;? In this case the investigator can let the x; be 
indicator variables. The indicator variable does exactly what the name implies; it 
indicates the treatment group to which the observation belongs. One scheme for 
indicator variables lets x = 1 indicate that the observation belongs in a particular 
group and lets x = 0 indicate that it does not belong in a particular treatment group. 

The meat storage experiment has four treatments and the model can have four 


indicator variables, x1, £2, x3, and x4. They take the values 1 and 0, as follows 


_ f 1ifcommercial wrap 
= 0 otherwise 


= 1 if vacuum 
v2 0 otherwise 
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_ f lif mixed gas 
73 =) (otherwise 

_ f lifpure CO, 
“4 =) (otherwise 


The model may be written as 
Y = Pix, + Bore + f3x3 + B4r4 +e 


If the observation came from the commercial wrap treatment, with x; = 1 and 
Lg = 03 = = 0 ; 


y=f~ite 


If the observation came from the vacuum treatment, with z.=1 and 
= 73 = 2 = 0 


y= fate 


and so forth. Notice Gy = 0 since it is not necessary for a description of the obser- 
vations in this case. 

The model y = ( + e then describes the observation that has a value 6 + e if 
it comes from the commercial wrap treatment. Thus 6; = s4;, the mean of the hy- 
pothesized commercial wrap population as illustrated in Figure 2.2. 

The means model in Equation (2.1) comes directly from this representation by 
letting 6, = 11, Bo = 2, 63 = bs, and {4 = 4. Given that the 2; take only the 
values 0 or 1, and if we let 3; = yz;, then an observation from the commercial wrap 
treatment is modeled as | 


Yy =i Te 
By adding subscripts to identify specific observations 
Yij = fi + 1; eS ae eee. 


is the model representing observations from the commercial wrap treatment. Gener- 
alizing the model to include all treatment groups results in the model shown in 
Equation (2.1), or 


Yij = Mi + ei 
iio + ju 1,.2...4 


This model expression for experiments with treatment groups is more traditional 
and provides a more specific description for the experiment. 

The flexibility of the general linear model in Equation (2.2) allows the inves- 
tigator to include measured covariates in the model along with the treatment group 
variables. For example, suppose in the meat storage experiment the investigator 
was concerned that fluctuations in the moisture content among the meat samples 
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could alter the rate of bacterial growth. Prior to packaging the meat, measurements 
are made on the moisture content of each of the meat experimental units. 

If we let x = moisture content (%), then a term representing a linear relation- 
ship between bacterial counts and moisture content can be added to the model as 


Yij = Mi + Pay + ei; 


where z;; is the moisture content of the = steak in the zth treatment group and ( is 
the linear regression coefficient. This is the traditional analysis of covariance 
model which assumes the regression coefficient is the same for all treatment 
groups. The analysis of covariance is discussed in Chapter 17. 


2.5 Estimation of the Model Parameters with Least Squares 


The least squares method is perhaps the method most frequently used to estimate 
the pena! for the linear model. The least squares estimates are ctl rasdellUiaoee sof 


ec squares estimators are ues wie minimum me Random ampline 

for observational studies and randomization in experiments provides insurance for 
the independence assumption as discussed 1 in Chapter 1. Methods to evaluate the 
homogeneous variance assumption are found in Chapter 4. 


Estimators for the Full Model 


la n means @;. Use — | - wr ‘ ie ACC ; ql Yij @ 

nly knov tities. If we inet he ee samates aes cat hes Li tet the 

full eeu as : Ti, fen the estimators of the experimental errors are €;; = yi; — H,. 
The minimum sum of squares is 


: t Tr t T 
SSE=) > =) > Gy a) (2.3) 
i=1 j=l i=1 j=l 


SSE is the estimated sum of squares for experimental error, and it is a measure of 
how well the model fits the data. 

A method of differential calculus is used to determine the estimators 2, that 
minimize the sum of squares 


Q = Def, = 2 (yij — pi)? 


fe . ed for the estimators. The 

convent | > eq ; the normal equations; however, the designa- 
tion a nothing to do with the normal ee haat d distibution. 

For ¢ treatment groups with r replications per treatment there are ¢ normal 

equations, one for each treatment mean. The normal equations are found by first 
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differentiating 


yy» (yi3 — Hs) 


with respect to each of the jz; and setting the result equal to zero. The partial deriva- 
tive for a typical equation is 


or 


where y;. is the total of the observations for the ith treatment.! 


The solution for the least squares estimator of a treatment mean [4; is 
} 


B= =F, 1=1,2,...,1 (2.4) 


It turns out that the estimators of the treatment population means based on the least 
squares criteria are the observed treatment group means. 
The normal equations for the meat storage study are 


37), =22.44 & Tet *) bacterin cout tofal 


371, = 16.50 
33 —= 21.78 
314 — 10.08 


! The dot notation is used to simplify the presentation of sums. The total of the observations for the ith 
treatment is denoted by y; with the dot indicating that all observations in the ith treatment group have 
been summed to give this total or 


a 
Yi. = So uj =yatyate + yer 
=I 


Also the total of all the observations is denoted by y., with the two dots indicating that summation has 
been completed for both subscripts or 
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The least squares estimates for the meat storage experiment are 


22.44 


ele 

Hy 3 

eee 

Ho = 3 Sf gee 

=. zie 

fig ee SE ae 9-96 
3 

fe 10.08 

fi, = —— = 3.36 


The sum of squares for 2 alone: error for the model y;; = yu; + e;; is 


SSE = 3 Yo 2) - ‘ Se (2.5) 
j=1 


Notice that SSE is the pooled sum of squares from within each of the treatment — 
groups. The sample variance for the ith treatment group is _ 


(Yiz — Vi)” 
= B (r—1) 


and it is an estimate of o? from the data in the ith treatment group. © 
If we can make the assumption that o” is homogeneous—that is, the same for 
all treatment groups—then 


4 t as S ° 


t(r — 1) ~ €(r—D vs) 


is a pooled estimate of o? from all the data in the experiment. The computed sums 
of squares and variance for the meat storage experiment are shown in Table 2.3. 


Estimators for the Reduced Model 


‘means, then the simpler 

or reduced model y;; = uw + The least squares esti- 
aide ey ee u ee the ied mana is alia pone: mean A all the observations in the 
experiment, 


(2.7) 


where NV = rt. 
The minimum sum of squares for TPS error som the reduced model 


under the null hypothesis is 


50 CHAPTER 2 GETTING STARTED WITH COMPLETELY RANDOMIZED DESIGNS 


Table 2.3 Observations, means, and within groups sums of squares for the meat 
storage experiment 


Commercial Vacuum CO. Gy CO) 
7.66 5.26 TAL 3.5] 
6.98 5.44 7.33 2.91 
7.80 5.80 7.04 3.66 
Dh, = Vi. 7.48 550. - 7.26 3.36 
S "Wii - Ti)” (02.3848 0.1512 0.0758 0.3150 
j=l 
SSE = 0.3848 + 0.1512 + 0.0758 + 0.3150 
Soe. 05205 eee 
= 5 = —— = 1585 — 
MoE hod). 40) = 
; r t Tr 
SSE. = oe = oe 0 (2.8) 
i=l j=l i=l j=l 


and 


SSE, = (7.66 — 5.90)? + --- + (3.66 — 5.90)” = 33.7996 


2.6 Sums of Squares to Identify Important Sources 
of Variation 


clarify and sen ane results a ihe: eieritibat “The computed experimental error 
sums S — = = two — of the meat packaging data were quite different. 


= pr : md SSE, = BE, = 3 33.7996 for the reduced model with only one 
populati n mean Wij 


I ‘parts from the r ‘model. The lifierence ance the 
eeoritiens an their chee group means //; , ena in Table 2.4, are with one 
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Table 2.4 Observed values, estimates, and deviations of observed values from 
estimates with the reduced model and the full model 


Reduced Model Full Model 
Yig = M+ Gi; Yij = Mi + Cig 
Observed Estimate Difference Estimate Difference 
Treatment Y mn (Yij — LB) Li; (yij — L;) 
Commercial 7.66 5.90 1.76 7.48 0.18 
6.98 5.90 1.08 7.48 — 0.50 
7.80 5.90 1.90 7.48 0.32 
Vacuum 5.26 5.90 — 0.64 5.50 — 0.24 
5.44 5.90 — 0.46 5.50 — 0.06 
5.80 5.90 — 0.10 5.50 0.30 
CO, O2, N — 7.41 —- 5.90 12 =. 726 Oe 
fe 5.90 1.43 7.26 0.07 
7.04 5.90 14. 2b 7.26 — 0.22 
100% CO, 3.51 $Me bas HQZ9 3.36 0.15 
2.91 5.90 — 2.99 3.36 — 0.45 
3.66 - 5.90 — 2.24 3.30 0.30 
Por, = as 1e 90 SSE; = 0.9268 


exception in the vacuum treatment, less than the differences between the 
observations and the grand mean, pj = 5.90, estimated from the reduced model. 
The two experimental error sums of squares and their difference follow. 


t r 
Reduced Model: SSS) (yy - 7) = 35196 
i=l j=l 
t r 
Full Model: SSE;=)_> >) ig — 3) = 0.9268 
| i=l j=l 


pe: vo J of ms > ee y, _ Es 3 yw Vi 


i=1 j=l i=1 j=1 


Difference: 


Pe aa ro es 


= as y YY; gi Y. ) a q ect emia WS Lis ced 
=] = ==] ines ? 


= 32.8728 


The difference sum of squares is the sum of squared differences between si 
treatment jal means Ui me = grand mean v.. : 3 


. The total sum of squares nee the eeuaicie SSE,. 
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at 


because it is the sum of squared differences between all observations and the grand 
mean y . 


el. Therefore, we have the relationship 


t r 


SS wi- = G7 tt 3) we- a) @9) 


t=1 j=l 1=1 j=1 t=1 j=l 


or 


SS Total = SS Treatment + S'S Error 


-e SS Treatment—the sum of squared differences between the treatment 
group means and the grand mean 


e SS Error—the sum of squared differences between the observations within 
the group and the group mean 


The sum of squares formulae may be derived from an identity for the deviation 
of any observation from the grand mean. The equation 


(yy —9.) = @&, -— 9.) + Wy -— Fi) (2.10) 


partitions the deviation of any observation from the 


grand mean into two parts. 


Beare a measure ia the eres error mesnied with the seeaaton. Squar- 
ing and summing both sides of the expression in Equation (2.10) results in 


t * 
> > ee ay 5G =F. HSE Ui)” 


#21 .j=1 i=1 j=l i=] j=! 


r 


me ese s ae 
ye wa ¥) = = :2 ee ee SS > Wy —%,) 
i=l j= 


i=1 j= i=1 j=1 


is identical to the sum of squares partition shown in Equation (2.9). 
A summary of the formulae for the sums of squares equivalent to the definition 
formulae of Equation (2.9) is_ 
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SS Total = Ss s (yi; -9.) 


Fa 
| t 
S'S Treatment = r 3 y,-yy 
i=1 


S'S Error = S'S Total — SS Treatment 


2./ A Treatment Effects Model 


. pebhe acim means neal can be cannes 
differently to oe be eave on treatments on the experimental units. Equation 
(2.10) may be rearranged slightly to express the observation as a function of the 
grand mean and the two deviations as 


vig = 9, + Gi V+ Wis — Fi) (2.11) 
An equivalent population model may be written as 
= B+ (wi — B+ (yg — 1) (2.12) 


where 7 = Df y;/t is the average of the population means for the cell means model 
Yi = Mi + ei. 


, and the model in Equation oe 12) is often written as 
SEE inne a8 


where » = 7, and e;; = (yj; — 4). The several expressions for the 
treatment effects are shown in Display 2.2. A graphical representation of treatment 
effects for the meat storage experiment is shown in Figure 2.3. 


Display 2.2 Treatment Effects 


Treatment 1 Treatment 2 oe Treatment t 


. Saupe G9) Ge) 2 7) 
bi Population (ui-—f) . (ue—-7) ee (Hi — Z) 
Effect T| T2 ee Tt 
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——_—_ — Tt, —_—_—_> 


Ae Tg 


4 Mo ph. Bz by 


Figure 2.3. A graphical representation of treatment effects 


The model in Equation (2.13) has (¢ + 1) population parameters, which are 
Ul, 71, 72,°**,T. AS a consequence of the definitions for treatment effects their sum 
is equal to zero: 
: 


(2.14) 


2.8 Degrees of Freedom 


There were ¢ parameters (11, /42,---, 4) estimated for the full model. Conse- 
quently, the number of statistically independent elements in the error sum of 
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squares for the full model is (NV — 1), 


The treatment sum of squares is determined from a difference between the 
SSEs for two models as 


S'S Treatment = SSE, — SSEs 


Thus, (¢ — 1) degrees of freedom are associated with the sum of ees reduction 
due to treatments. 


2.9 Summaries in the Analysis of Variance Table 


A= 


The analysis of variance table summarizes our RUIEGES apo = in = 
oo — the senie meee | : 


-as the mean squé< EB) The ee mean — of 
inipORARRE is Sakic mean square © for treatments (M ST), computed as 


SS Treatment 
=1) 


The sum of squares partitions, degrees of freedom, and mean squares are sum- 
marized in an analysis of variance table like that shown in Table 2.5. 


Msz = 


Table 2.5 Analysis of variance table for a one-way treatment classification in a 
completely randomized design 


Source of Degrees of Sum of Mean 
Variation Freedom Squares Square 
Total N-1 SS Total 

Treatments . t-i1 SS Treatment MST = a 
Error N-t S'S Error MSE = ooe 
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% The MSE is an unbiased estimate of the experimental error variance 07; that 
is, the expected value of MSE is equal to 0”, or ‘ : 


(2.15) 
The expected value of MST’ is 


(2.16) 


where 


E(MSE) a and 4 E(MST) a oe renin 2A 2 ‘of this chapter. 


& 


2. 10 Tests of ey parnesee About Linear Models © 


The complete analysis ~ variance table for the meat storage pascal is — in 
Table 2.6. 1 sum 


yn i Tin ether eines vareatien due to eatments is senifitantty 
greater than Ando Epoiieniel error requires a test of hypothesis. 


Table 2.6 Analysis of variance for log(count/cm*) of psychrotrophic bacteria 
from the meat storage experiment 


Source of Degreesof Sumof Mean : 

Variation Freedom Squares Square Fr ifs a 
Total 1] 33.800 
Wrap 3 32.873 10.958 94.58 .000 
Error 8 0.927 0.116 


EERE EEE nner 


We could use a —— test vaseu on the a of variance pall 


vanies to Fev RIE the aSantoniOn of reilly distributed observations are found 
in Chapter 4. 
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A Sums of Squares Difference to Compare Two Models 


1 data, and SSE, is 


(SSE, — SSEs) 
SSE; 


The F Statistic to Test a Model Hypothesis 


m statistica y, it is known that the sums of squares of normally distributed 
random variables are associated with the chi-square distribution. It can be shown 
‘fut t SSE, /o* is distributed as the chi-square variable with (NV — t) degrees of 
freedom. The difference SST = SSE, — SSE; with (t — 1) degrees of freedom 
represents the reduction in the total sum of squares due to differences among the 
treatment means. When the treatment means are equal, 4; = 2 = --- = pz, it can 
be shown that this criterion, (SSE, — SSE;)/a? = SST'/o?, also has a chi-square 
distribution with (¢— 1) degrees of freedom and that it is independent of the 
distribution of SSE; /o. 
The ratio 
p_ SSE, ~ SSE 1) 


eee cee ae (2.17) 


is the ratio of two chi-square distributions each divided by their respective degrees 
of freedom. Under the null hypothesis of no difference between treatments the ratio 
has the F distribution with (¢t — 1) and (NV — t) degrees of freedom, respectively, in 
the numerator and denominator. 

The test statistic computed from the analysis of variance table to test the null 
hypothesis Ho: py = po = --- = py IS 


ea (2.18) 


which has the F distribution when the null hypothesis is true. 
The expected mean ieee in Equations (2.15) and (2. 16) showy that M SE is 
an RNB DIaES Psa’ of o? under either model hypothesis. But, MS 


. y, the expec ne “i He RGF ea a statistic 
will be greater than a af the denominator. Large values of Fo would suggest 
rejection of the null hypothesis in favor of the alternate. 

A one-sided, upper-tail critical region is implied for the hypothesis test. The 
null hypothesis Ho is rejected for a Type I error probability of a if 
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P06 an iN) 


where Fy = MST’ /MSE and Fy, (¢-1)w-1) is the value of the F’ distribution that 
is exceeded with probability a. Critical values of the F distribution are found in 
Appendix Table IV. } 

A test of the hypothesis of no differences among the four meat packaging treat- 
ments with respect to growth of bacteria in the meat storage experiment is illus- 
trated in Display 2.3. From the analysis of variance (Table 2.6) the required mean 
squares are MST’ = 10.958 with 3 asta of freedom and MSE = 0.116 with 8 
degrees of freedom. 


Display 2.3 Hypothesis Test for the Meat Storage Experiment 


Ao : pf = 2 = 3 = bl 
H, : wi ~ px for at least ones # k 


a=.05 Critical Region: fo > F'o5,3,.3 = 4.07 


MST _ 10.958 
= 045 
ES 116 ° 


Since Fy = 94.58 falls in the critical region, Fo > 4.07, we reject the null 
hypothesis and conclude that the treatments differ with respect to the number of 
psychrotrophic bacteria observed on the meat stored under the different conditions. 


2.11 Significance Testing and Tests of Hypotheses 


A common practice in hypothesis testing is to report the significance level of a test 
statistic. The significance level of a test statistic is the probability of exceeding the 
value of the test statistic under the null hypothesis condition. Note the column 
labeled Pr > F in Table 2.6. The value .000 in that column is the probability that 
the F statistic with 3 and 8 degrees of freedom will exceed Fo = 94.58 and often is 
referred to as the “P-value.” Since the reported value is .000, we know the proba- 
bility of exceeding Fo = 94.58 is less than .0001, or P < .0001. The magnitude of 
the P-value is used by many investigators to decide the statistical significance of 
the F test in the analysis of variance. The value is frequently reported in a discus- 
sion of results. For example, the F’ test for the present example may be reported as 
“significant at the P < .0001 level of significance.” If the probability value is less 
than the traditional significance levels of .01 or .05 the null hypothesis will be re- 
jected because the observed Fp statistic is in the critical region. 

Most analysis of variance computer programs include this probability value for 
the F statistic in the printed output. A technical discussion of the P-value and its 
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relationships to the number of replications and to the alternative hypothesis can be 
found in Hung et al. (1997) and references cited therein. 


2.12 Standard Errors and Confidence Intervals 
for Treatment Means 


In Section 2.5 it was determined that the least squares estimators of the population 
means were the observed means of the treatment groups y;. The observed means 
are averages of r independent observations, so that the variance of a treatment 


group mean is oz = o*/r. The estimator of the variance is 


gig. te 2.19 
ae (2.19) 


Vi. 
where s* = MSE from the analysis of variance. The estimator for the standard er- 
ror of the mean is 


a, = 4) — (2.20) 


A 100(1 — a)% confidence interval (CI) estimate is constructed for each of the 
treatment group means with upper and lower limits, respectively, where ty /2,(—t) 1S 
the Student f statistic exceeded with probability a/2 and (NV — t) are the degrees of 


freedom for MSE: 
Yi, t tas2(n—t)(Sy,) and Y, — tayeyn—z(Sz,) (2.21) 
The standard error of the mean for any treatment group in the meat storage 
study is | 
0.116 
a 0.197 


and t.925,3 = 2.306. The treatment group means are shown in Display 2.4 with the 
95% CI estimates. 


Display 2.4 Means, Standard Errors, and 95% Confidence 
Intervals for the Meat Storage Experiment 


Treatment Mean _ Standard Error 95% CI (Lower, Upper) 


Commercial TAB 0.197 T.A8 £ 0.454 (7.03, 7.93) 
Vaccum 5.50 0.197 5.50 + 0.454 (5.05, 5.95) 
CO.0,, NV 7.26 0.197 7.26 + 0.454 (6.81, 3.81) 
100% CO, 3.36 0.197 3.36 + 0.454 (2.91, 3.81) 
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For any group mean the 95% CI is 
y, +(2.306)(0.197) or YY; £0.454 


2.13 Unequal Replication of the Treatments 


Unequal replication may occur because some experimental units were lost during 
the study; there were insufficient numbers of subjects available for all study groups; 
or collected data were lost, destroyed, or invalid. The most visible effects of un- 
equal replication appear in the computations required for the analysis. A conse- 
quence of greater concern is the ensuing unequal information on all treatment 
groups. The loss of observations from any treatment group results in a proportional 
loss in precision on the estimates of the treatment group mean relative to those 
treatment groups with complete data. 


Example 2.2 Detection of Phlebitis in Amiodarone Therapy 


Phlebitis is an inflammation of a vein that can occur when intravenously ad- 
ministering drugs. The active drug was thought to be the main contributing 
factor to inflammation, although the vehicle solution used to carry the drug 
through intravenous administration could be a possible contributor. 


Research Hypothesis: Of particular importance to the investigators was the 
problem of detecting the onset of phlebitis. This particular study was to ex- 
plore mechanisms for early detection of phlebitis during Amiodarone therapy. 
They hypothesized that tissue temperature changes near the intravenous ad- 
ministration would be an early signal for the impending inflammation. 


Treatment Design: Three intravenous treatments were administered to test 
animals. They were 


The saline solution served as a placebo control treatment to determine wheth- 
er the act of administration alone affected inflammation. The vehicle solution 
served as a control to separate any effects of the vehicle from the effects of 
the drug. 


Experiment Design: Rabbits used as the test animals were randomly assigned 
to the three treatment groups in a completely randomized design. An intra- 
venous needle was inserted in a vein of one ear of the rabbits using one of the 
three treatments. 
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An increase in the temperature of the treated ear was considered as a 
possible early indicator of phlebitis. 7 


‘ Compli- 
cations with the experimental protocol resulted in a different number of rab- 
bits for each of the treatment groups. The observed temperature differences at 
4.5 hours for each of the rabbits remaining in the study are given in Table 
Se | 


Table 2.7 Ear temperature (°C) differences, treated minus untreated, of rabbits 
4.5 hours after treatment 


Amiodarone Vehicle Saline 

ZZ 0.3 0.1 

1.6 0.0 0.1 

0.8 0.6 0.2 

1.8 | 0.0 — 0.4 

1.4 — 0.3 0.3 

0.4 0.2 0.1 

0.6 0.1 

Be — 0.5 

0.5 
‘7 9 6 8 Ps > Y= 23 

i=l 

total (y;.) 10.80 0.80 0.00 y = 11.60 
mean (¥; ) 1.20 0.13 0.00 y =0.50 


Source: G. Ward, Department of Pharmaceutical Sciences, University of Arizona. 


The Linear Model for Unequal Replications 


The cell means model for the one-way treatment classification in the completely 
randomized design with unequal replication is 


Yig = Mi + ij (2.22) 
ee Fe ee toe se eee = 


where r; is the number of replications for the 7th treatment group. The total number 
of observations is NV = }ir;. The interpretation and assumptions are the same as 
those for the equal replication model. 

The least squares estimator for treatment means determined by the methods 
outlined in Section 2.5 are 


“2p, =— toe 1, 2. 4nSt (2.23) 


or the observed treatment group mean, as for equal replication. 
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The Analysis of Variance for Unequal Replications 


The sums of squares partitions for the analysis of variance are shown in Table 2.8. 
Each element of the sum of squares for treatments is a weighted square of the 
deviation of a treatment mean from the grand mean, r;(y,; — 7_)?. The weight, r;, is 
the number of replications for the treatment group. The weights reflect the amount 
of information available for the estimation of the treatment means. The sum of 
squares for experimental error is the pooled sum of squares within the treatment 
groups. The expected mean square for treatments includes a weighted sum of 
squares of the treatment effects 7; = (u; — 7), where Z = Ui rjp;/N. With un- 
equal replication, our definition of the treatment effects means their weighted sum 
is equal to zero, Li ri(u; — 7) = 0. 


Table 2.8 Analysis of variance for a completely randomized design with unequal 
replication of treatments 


Source of Degrees of Sum of Mean Expected 
Variation Freedom Squares Square Mean Square 
Total N-~t 3 fe og) 
i=1 j=1 
t 
Treatments t—1 ri(y; -y.y° MST o* +6? 
i=1 
t Ti 
Error N-t > wii - ¥,)” MSE o 
i=1 j=1 


t 
0 = ay 2d rile —By 


The analysis of variance for the Amiodarone study is shown in Table 2.9. The 
null hypothesis of no differences among the treatment means is tested with the 
statistic AM MST /MSE = 3.6081/0.2177 £16.58. The F statistic is found in 
the column labeled “F” in Table 2.9. The null hypothesis is rejected at the .05 level 
of significance because Fo > F522 =3.490r notice that Pr > F = .000 in 
Table 2.9. 


Table 2.9 Analysis of variance for ear temperature differences from the Amioda- 


rone study 

Source of Degrees of Sum of Mean 

Variation Freedom Squares Square F Pr>F 
Total 22 11.5696 

Treatment 2 7.2162 3.6081 16.58 .000 


Error 20 4.3533 G20 
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Standard Errors for Treatment Means and Unequal Precision 


The standard error of a treatment group mean is estimated by 


MSE 


oy, 


(2.24) 


<I 
| 
25 
S. 


O.Z177 
87, = 9 = 0.156 
02177 
8, = { — — = 0.190 
and 
: 0.2177 


The differences among the standard errors of the three treatment group means 
illustrate the decreased precision when data are lost from an experiment. For exam- 
ple, the vehicle treatment group with six observations has a standard error 22% 
greater than the Amiodarone treatment group with nine observations. 


2.14 How Many Replications for the F Test? 


In Chapter | the number of replications required to detect some predetermined dif- 
ference between two treatment means was ascertained with the normal z statistic in 
conjunction with knowledge of the variance, significance level, and power of the 
test. The required number of replications is determined in this section by a method 
based on the F statistic. 

The power of a test of hypothesis is the probability of rejecting a false null 
hypothesis. The statistic Fy = MST'/MSE is used to test the null hypothesis Hp: 
T; = 0. The power of the test is 1 -@ = P(F > Fyy,,|Ho false), where 1 and 1» 
are the numerator and denominator degrees of freedom, respectively. When Hp is 
false, fo has the non-central F distribution with vy; and v2 degrees freedom and 
non-centrality parameter \ =r ey T?/o7. If the null hypothesis is true, then the 
non-centrality parameter has the value \ = 0, since all 7; = 0 and Fp has the cen- 
tral F distribution. 

Tables have been computed that tabulate the power of the F test for given 
values of the significance level a; power 1 — 3; degrees of freedom v; and v2; and 
®, a function of the non-centrality parameter, which is 
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(2.25) 


Charts of these power curves are found in Appendix Table IX for selected 
parameter values of the non-central F' distribution. 

The charts are used to estimate the required number of replications for given 
values of a, 1 — 8, 07, 4, 2, and ®. A value for ® requires a value for o” and 
specific values of the treatment means that lead to rejection of the null hypothesis. 
Given 11, {2,---, pz, the 7; = (u; — ZZ) are evaluated for ® in Equation (2.25). 


Example 2.3. Replication Number for the Amiodarone Study 


Suppose for future experiments the investigator with the Amiodarone study in 
Example 2.2 was interested in rejecting the null hypothesis with a power of at 
least .90 at the .05 level of significance if the ear temperature difference for 
the drug treatment group was 0.8°C while the vehicle and saline means were 
0.1°C and 0°C, respectively. 


Ti =p —-p =0:5—-0.3=05 
T =p, -f =01—-03= —02 
73 =p3—-p = 00-03 = —0.3 


Therefore, ae T? = 0.38, and we can use MSE = 0.22 from the analysis of 
variance as an estimate for o*. From Equation (2.25) evaluate 


2 

a ee 
a Pay Tap agy as 
to? 3(0.22) 


The required values for the power curve are 4,=(f—1)=2, 
vy = t(r—1)=3(r—1), and a=.05. Using r=5 as a first trial value 
yields @ = a9 = |.7 and v. = 12. From Appendix Table IX the power of 
the test is approximately .65, which is less than the required .90. Increasing 
the replication number to r = 9 yields ® = ,/5.22 = 2.3 and 12 = 24, witha 
resulting power in excess of .90. It appears that nine rabbits will be required 
in each treatment group to yield a power of at least .90. 


It is difficult to specify desired effects for a complete set of treatments. It may 
be easier to specify the difference between any two treatment means that would be 
physically or biologically significant. Suppose it is desired to detect significance at | 
a difference of D = pi; — y;. The minimum value of ®? in that case is 
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B. rD? 


= (2.26) 


Comments: Commercial computer programs are available to estimate replication 
numbers with considerable ease, although the investigator must still have available 
estimates of 07, desired power, significance levels, and magnitude of effects to de- 
clare significant. 

Estimates of o? ordinarily are difficult to declare. If results are available from 
previous experiments similar to the one contemplated, then variance estimates from 
those experiments may be pooled to attain a reasonable value. 

When the entire experiment cannot be performed at one time or location with 
the required number of replications, the experiment can be repeated several times 
or at several locations. For example, if eight replications are required and they can- 
not be performed at one time, then perform the experiment twice with four replica- 
tions each time. | 


EXERCISES FOR CHAPTER 2 


1. A traffic engineering study on traffic delay was conducted at intersections with signals on urban 
streets. Three types of traffic signals were utilized in the study: (1) pretimed, (2) semi-actuated, and 
(3) fully actuated. Five intersections were used for each type of signal. The measure of traffic delay 
used in the study was the average stopped time per vehicle at each of the intersections 
(seconds/vehicle). The data follow. 


a ° 8 


Pretimed Semi-actuated Fully actuated 
36.6 17.5 15.0 
39.2 20.6 10.4 
30.4 18.7 18.9 
37.4 27 10.5 
34.1 22.0 1572 


Source: W. Reilly, C. Gardner, and J. Kell (1976). A technique for 
measurement of delay at intersections. Technical Report FHWA-RD-76- 
135, Federal Highway Administration, Office of R & D, Washington, 
De. 


Write the linear statistical model for this study, and explain the model components. 

State the assumptions necessary for an analysis of variance of the data. 

Compute the analysis of variance for the data. 

Compute the least squares means of the traffic delay and their standard errors for each signal 


type. 


Compute the 95% confidence interval estimates of the signal type means. 
p yp 
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f. Test the hypothesis of no difference among the mean traffic delays of the signal types with the 
F test at the .05 level of significance. 
g. Write the normal equations for the data. 


2. An experiment was conducted to test the effects of nitrogen fertilizer on lettuce production. Five 


rates of ammonium nitrate were applied to four replicate plots in a completely randomized design. 
The data are the number of heads of lettuce harvested from the plot. 


Treatment (lb N/acre) Heads of lettuce/plot 


0 104,114, 90, 140 
50 134, 130, 144, 174 
100 146, 142, 152, 156 
150 147, 160, 160, 163 
200 131, 148, 154, 163 


Source: Dr. B. Gardner, Department of Soil and Water 
Science, University of Arizona. 


Write the linear statistical model for this study, and explain the model components. 

State the assumptions necessary for an analysis of variance of the data. 

Compute the analysis of variance for the data. 

Compute the least squares means and their standard errors for each nitrogen level. 

Compute the 95% confidence interval estimates of the nitrogen level means. 

Test the hypothesis of no difference among the means of the nitrogen levels with the F test at 
the .05 level of significance. 

Write the normal equations for the data. 

This experiment was conducted in a completely randomized design with the field plots in a 
4 x 5 rectangular array of plots. Show a randomization of the five nitrogen treatments to the 
20 plots using a random permutation of the numbers | through 20. 


mhooaogp 


one 


3. An animal physiologist studied the pituitary function of hens put through a standard forced molt 
regimen used by egg producers to bring the hens back into egg production. Twenty-five hens were 
used for the study. Five hens were used for measurements at the premolt stage prior to the forced 
molt regimen and at the end of each of four stages of the forced molt regimen. The five stages of 
the regimen were (1) premolt (control), (2) fasting for 8 days, (3) 60 grams of bran per day for 10 
days, (4) 80 grams of bran per day for 10 days, and (5) laying mash for 42 days. The objective was 
to follow various physiological responses associated with the pituitary function of the hens during 
the regimen to aid in explaining why the hens will come back into production after the forced molt. 
One of the compounds measured was serum T3 concentration. The data in the table are the serum 
T3 measurements for each of the five hens sacrificed at the end of each stage of the regimen. 
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Treatment , Serum T 3, (ng/dl) x 107! 


Premolt 94.09. 9045, ..99:38,- »-7356,-:-74,39 
Fasting 98.81, 103 85e: 1$23,-:1290624119-61 


60 g bran 197 A Bypah2O7 3h V17 SO; 22608; 222.74 
80 g bran 102.0370 Bigs 41997, 3411201 TOT AO 
Laying mash 83.14, 89.59, 87.76, 96.43, 82.94 


SSE a RES ST SE a es SS 
Source: Dr. R. Chiasson and K. Krown, Department of Veterinary Science, Uni- 
versity of Arizona. 


Write the linear statistical model for this study, and explain the model components. 

State the assumptions necessary for an analysis of variance of the data. 

Compute the analysis of variance for the data. 

Compute the least squares means and their standard errors for each treatment. 

Compute the 95% confidence interval estimates of the treatment means. 

Test the hypothesis of no differences among means of the five treatments with the F test at the 
.O5 level of significance. 

Write the normal equations for the data. 

This experiment was conducted in a completely randomized design with one hen in each of 25 
pens. Show a random allocation of the five treatments to the 25 pens with a random permuta- 
tion of the numbers | through 25. 


eee oe, ae on 


aille S 


Data were collected on student teachers relative to their use of certain teaching strategies that had 
been presented to them in preservice education. There were 28 student teachers who had learned to 
use the strategies (9 in 1979, 9 in 1980, and 10 in 1981). In 1978 there were 6 teachers who did not 
learn to use the strategies, and they were used as a control group. The investigator recorded the 
average number of strategies used per week by each of the student teachers during their student 
teaching assignments. The investigator wanted to know whether the number of strategies used by 
the student teachers was different among the years. 


Average Number of Different Strategies Used 
Control 1978 1979 1980 198] 


6.88 tiz0- 1085 7.29 
5.40 10.50 7.43 14.38 
16.00 8.43 6.71 6.00 
9.80 8.63 7.60 5.00 
7.63 8.63 7.60 5.38 
5.00 fuer” Dot 14.14 


lias S71 pie 
P25. 9.80 a 8 | 
[Use 7.20 7.32 
1.75 


re S 
Source: Dr. A. Knorr, Family and Consumer Resources, 
University of Arizona. 
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mooaoge 


g. 


5. In a particular calibration study on atomic absorption spectroscopy the response measurements 
were the absorbance units on the instrument in response to the amount of copper in a dilute acid 
solution. Five levels of copper were used in the study with four replications of the zero level and 
two replications of the other four levels. The spectroscopy data for each of the copper levels are 


Write the linear statistical model for this study, and explain the model components. 
State the assumptions necessary for an analysis of variance of the data. 

Compute the analysis of variance for the data. 
Compute the least squares means and their standard errors for each treatment. 

Compute the 95% confidence interval estimates of the treatment means. 

Test the hypothesis of no differences among means of the four treatments with the F test at the 
.05 level of significance. 

Write the normal equations for the data. 


given in the table as micrograms copper/milliliter of solution. 


de cot See oy 


- ee 


6. Consider the experiment in Exercise 3. Suppose some of the chickens were lost during the course 


Copper (mg/ml) 


0.00 0.05 0.10 0.20 0.50 
0.045 O084sc O49 ds: POSS - 0.395. 


0.047 0.087 - 0416 OCP 3: 0.399 
0.051 
0.054 


Source: R. J. Carroll, C. H. Spiegelman, and J. Sacks 
(1988), A quick and easy multiple-use calibration-curve 
procedure, Technometrics 30, 137-141. 


Write the linear statistical model for this study, and explain the model components. 
State the assumptions necessary for an analysis of variance of the data. 

Compute the analysis of variance for the data. 
Compute the least squares means and their standard errors for each treatment. 

Compute the 95% confidence interval estimates of the treatment means. 

Test the hypothesis of no differences among means of the five treatments with the F test at the 
.05 level of significance. 

Write the normal equations for the data. 
Each of the dilute acid solutions had to be prepared individually by one technician. To prevent 
any systematic errors from preparation of the first solution to the twelfth solution, she prepared 
them in random order. Show a random preparation order of the 12 solutions using a random 
permutation of the numbers | through 12. 


of the experiment, resulting in the following set of observations. 


af o 2 


g. 
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Treatment Serum T3, (ng/dl) x 107! 
Premolt 94.09, 90.45, 99.38, 73.56, 
Fasting 98.84, 0803 39. 115.23, 129.06, 117.61 


60 g bran LOT BS 2O731; ~ 17750, 
80 g bran HOS, So eee er, TOF TO 
Laying mash 82.94, 83.14, 89.59, 87.76, 


Write the linear statistical model for this study, and explain the model components. 

State the assumptions necessary for an analysis of variance of the data. 

Compute the analysis of variance for the data. 

Compute the least squares means and their standard errors for each treatment. How has the loss 
of some chickens from the experiment affected the estimates of the means? 

Compute the 95% confidence interval estimates of the treatment means. 

Test the hypothesis of no differences among means of the five treatments with the F test at the 
.05 level of significance. 

Write the normal equations for the data. 


Use the data from Exercise 3 to determine how many chickens the biologist would need for each 
treatment to reject the null hypothesis at the .05 level of significance with a power of .90 if the 
difference between the control treatment and any new treatment was 30 units of T3 concentration. 


Use the data from Exercise | to determine how many intersections the traffic engineer would need 
for each type of traffic signal to reject the null hypothesis at the .01 level of significance with a 
power of .90 if mean delays at the three traffic signal types were 20, 18, and 16 seconds, 
respectively. 


The following is a small exercise to help you understand how the least squares principle works to 
provide a minimum sum of squares for experimental error. 


a. 


Use Equation (2.3) and the data from Exercise 2.1 to compute SSF for the following cases: 

(i) Use the smallest observation in each treatment group for 4, in Equation (2.3) to compute 
SSE. 

(ii) Use the largest observation in each treatment group for jz; in Equation (2.3) to compute 
SSE. 

(ili) Use the mean of the observations in each treatment group for 1; in Equation (2.3) to 
compute SSE. | 

(iv) Substitute another value between the mean and the largest observation for 4, in Equation 
(2.3) to compute SSE. 

(v) Substitute another value between the mean and the smallest observation for 7; in Equation 
(2.3) to compute SSE. 

Plot SSE versus j4;, with your computed values of SS'E on the vertical axis and the 7, values 

on the horizontal axis. 

What is the value of SSF for each of the cases? Which case provides the smallest value of 

SSE? 
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10. One of the assumptions for the linear model we use to describe our data in the treatment groups 
(yi; = Wi t+ ei;), is that the observations were random, independent observations of the random 
variable Y. 


a. 


2A.1 


What would you have to do during the conduct of the study for the assumption of random and 
independent observations to be reasonable if 

(i) it was a designed experiment? 

(ii) it was a comparative observational study? 

Before you can call the least squares estimators minimum variance and unbiased estimators, 
what additional assumptions do you have to make about the model? 

Before you can compute confidence intervals and test hypotheses about the model, what addi- 
tional assumption do you have to make? . 

Suppose you could not make either of the last two assumptions (those in parts b. and c.). How 
would you be able to test the hypothesis of no difference among the treatment means? 


Appendix: Expected Values 


The expected value of a random variable is its average value. If a random variable 
Y has a probability distribution with a mean y and a variance o”, the expected val- 
ue of Y is defined as pp = E(Y), where E(Y) is read as “the expected value of Y.” 
The variance of Y is defined as o? = E(Y — 1)”, which is the expected value 
of the square of the difference between Y and the mean. 
If there are two random variables, Y; and Yo, and E(Y,;)=p, and 
E(Y2) = po, then the covariance between the two variables Y; and Y2 is defined as 


O12 = E[(% — t1)(¥%2 — H2)] 


The covariance indicates the relationship between Y; and Y. If large values of 
Y; are associated with large values of Y2, then the covariance is positive. If values 
of Y; become smaller as values of Y> become larger or vice versa, then the covari- 
ance is negative. If the values of Y; and Y> are independent, then the covariance is 
zero. 7 

The cell means model for the completely randomized design is 


Yij = Mi + ij (2.1) 


p42 ar per, Bee 


The experimental errors e;; are assumed to be independent random variables 
with a mean of zero, variance o”, and zero covariance between any two errors. 

The mean or expected value of any e;; is E(e;;) = 0, and the variance of any 
ei; is E(e?,) = 07. Since there is no covariance between the e;;, the expectation of a 
product between any two error terms in the same or different treatment groups is 
E(e;; + €mk) = 0, where 1 A m or 7 #k. The py; are the population means and are 
constants with respect to the expectation operation. The expected value of a con- 
stant is the constant value itself, or F(;) = pj. 
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The expected value of any observation described by the cell means model may 
be found by substitution of 4; + e;; for y;; in the expectation: 


E(yis) = E(u; + ej) = E(ui) + E(eij) = wi (2A.2) 


2A.2 Appendix: Expected Mean Squares 


The expected values are required for MSE and MST in the analysis of variance. 
With any number of replications for each of the treatments the mean square for 
experimental error is estimated as MSE = SSE/(N —t), where 


SSE = 3 » (is — Bi)” (2A.3) 
=] 3= 
The expectation of SSE is found with the substitutions y;; = ju; +e; and 
cee 
Y;, = i + &, into Equation (2A.3), where @;, = (—) a e;;. The resulting ex- 


pression is 


t t ri 
SSE = ae a “fo Ci = (CF “bh é;,)]? = 2 2, (e, “= é;.)° 
=| j= 


Ti 1 1 
Given E(é) = = E( oe?) = = Ele}, +e3, +--- +e?) = — o’, the expecta- 
i= a r 


2 j 
r; j= i i 
tion of SS'F is 
t tT 
E(SSE)=)— )_ Ee.) - ec; ) 
ee =] 
= No? — to? 
=(N —t)o’ 
and 
| E(SSE) _ 9 
E(MSE 2A.4 
(MSE) = =" = (24.4) 


The mean square for treatments is ST’ = SST/(t — 1), where 


t 


SSF =) 1G, —-7Y  (2A,5) 


i=1 
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and the expectation of SST" requires the eae of ¥, = Mi fe and 
yY =p +€_ into Equation (2A.5), where € > Y e,;andzZ = — x ri 
1 j=1 


The resulting expression is 


t 
SST =) rilwte,—# ee 
i=1 
t 


t t 
= riui- BY + rie. -— 2. +2) rie -— BG. - 2.) 
ta] 14 =] 
Expanding the second term of the last expression yields 
t | t 

S. ri(€, — €,)° = a re, — N@ 

i=1 t=1 
Given 


] 3 | 1. 
Po 6 Be Re ee E(e?, + e7, +--+: + e2, + crossproducts) = al 


I 
and E(é?) = — o?, the expectation of SST is found as_ 
r% 


ris — B) + one :) — NE?) 


ot 


E(SST’) = 
1 


@-. 
+ || 


Ti(Mi — qi.) aie" — 27 


». 
+ || 
_ 


Tipe — —) +> 1)o? 
i=1 


The expectation of MST" is 
ECS ST) _ ihe 
U = I) ‘i=l 


The expected value in Equation (2A.6) can be expressed in terms of the treatment 
effects by the substitution 7; = Hi — 7p. ). 


If all r; = r, then let 0? = = (ui - —f)?(t — 1) and 


E(MST) = any (2A.6) 


E(SST) _ 


E(MST) = eoaiy 


o? +76? (2A.7) 


3 Treatment Comparisons 


The analysis of variance and least squares estimates of treatment group means 
provide the basic information necessary for an in-depth analysis of research 
hypotheses using methods introduced in this chapter. The methods for an in-depth 
analysis of the responses to the treatment design include planned contrasts among 
treatment groups, regression response curves for quantitative treatment factors, 
selection of the best subset of treatments, comparison of treatments to the control, 
and all pairwise comparisons among treatment means. All of these methods involve 
a set of simultaneous decisions to be made by the investigator. This simultaneous 
statistical inference affects statistical errors of inference. Some of those effects and 
the control of those errors are discussed in this chapter. 


3.1 Treatment Comparisons Answer Research Questions 


The relationship between research objectives and treatment design requires us to 
identify treatments relative to their role in the evaluation of research hypotheses. 
When an experiment is conducted to answer specific questions, the treatments are 
selected such that comparisons among the treatments will answer the questions. For 
example, specific questions can be answered from the meat storage experiment in 
Chapter 2 about the effect of different storage conditions on the growth of bacteria 
on meat during storage. The four treatments for the meat storage experiment were 
(1) commercial wrap, (2) vacuum, (3) mixed gases, and (4) pure CO2. The sum- 
mary statistics from the experiment are shown in Display 3.1. 
Questions that can be asked about meat storage conditions include - 


e Is the creation of an artificial atmosphere more effective in reducing bacte- 
~ rial growth than ambient air with commercial wrap? 


i 
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Display 3.1 Summary Statistics from the Meat Storage 
Experiment of Example 2.1 


Treatment 


Commercial Vacuum CO, O77 CO, 
5.50 7.26 3.36 
s* = MSE = 0.116 with 8 degrees of freedom 


o Are the gases more effective in reducing bacterial growth than a complete 
vacuum? 


‘e Is pure CO, more effective than a mixture of CO, Og, and N in reducing 
bacterial growth? 


3.2 Planning Comparisons Among Treatments 


Contrasts among treatment means can be constructed to answer specific questions 
formulated for the experiment. Contrasts are special forms of linear functions of 
observations (discussed in Appendix 3A). A contrast among means is defined as 


t 
C =>) kip = Bip + koa +--+ + hep (3.1) 


i=1 


t 
where = k; ae (1); 
it 
The appropriate contrasts for the three specific questions from the meat storage 
experiment are 


o— hd. — ho a ee T [b4 
f - Sw ate =) < 


© vacuum versus gases: 


e mixed gases versus pure CO:: 
Cs = ps = pa 
The first contrast is the difference between the mean of the commercial wrap and 


the average of the means for the other treatments. The sum of the coefficients 


d,- 5 _ , — ;) is zero, as it should be for a proper contrast. The second contrast 
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is the difference between the mean of the vacuum wrap and the average of the 
means for the gas treatments with coefficients (0,1, — + _ =)s while the third 
contrast is a difference between the means of the two gas treatments with 
coefficients (0,0,1, — 1). 

Estimates of the contrasts, standard errors, confidence interval estimates, and 
tests of hypotheses about the contrasts are computed from the observed data. Any 
contrast of the population treatment means, C' = Xi kjp;, is estimated by the same 
contrast of the observed treatment means as 


t 
a De key, = kiyy, +--+ + eG, (3.2) 


where ¥, is the mean of the zth treatment group. 
The estimates for the three contrasts in the meat storage experiment are 


3.90 


beeps 7.26 — 3.36 = 
meee el et Re ee niet iS ene Bi nat a 


Assessment of Contrast Estimates 


Standard Errors of Contrasts 


The variance of a contrast estimate c = 1 k;y;, is estimated with 


t 
| k? 45 k2 +H] 
2 2 i ey pretty t 
a i ag i 4 3.3 
Ss » |=: Bebe. - = (3.3) 


=] VT; T1 


where s? = MSE from the analysis of variance. If the r; are equal, the variance 
estimator simplifies to 


2 
32 = = [ki + hp +--+ + ke] (3.4) 


The estimator for the standard error of a contrast is the square root of the variance, 


Se = 4/82 (3.5) 


inves jg bi ay .116 and all rj; = 3 . The variance and 
Renderd ¢ error peereeny for the Te contrast a = 
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oa = “S14 (-5)'+(-5)'+(-5) | = 0.052 


and 


Se, = +/0.052 = 0.228 


The standard error estimates for the second and third contrasts are s,, = 0.241 and 
Sc, = 0.278. 


Interval Estimates for Contrasts 
The 100(1 — a)% confidence interval estimator for a contrast is 
CH tajo(n-t)(Sc) (3.6) 


The correct degrees of freedom for the Student ¢ statistic are the degrees of freedom 
for the variance used to compute the standard error of the contrast. The variance 
estimate for the meat storage experiment is MSE =0.116 with (N —t)=8 
degrees of freedom. The 95% confidence interval estimate for C, using t925,.3 = 
2.306 is | 


2.11 + (2.306)(0.228) 


The 95% interval estimates for the complete set of contrasts are shown in Table 3.1. 


Table 3.1 The estimates, standard errors, and 95% confidence interval estimates 
of three contrasts for the meat storage experiment 


Standard 95% CI 
Contrast Estimate Error Lower Upper 
Ci fokt 0.228 1.58 2.64 
Cy 0.19 0.241 Ua 0.75 
C3 3.90 0.278 3.26 4.54 


A Sum of Squares Partition for the Contrast 


A sum of squares is calculated for the treatment contrast to indicate how much 
of the variation in the data can be explained by that specific contrast. The sum of 
squares reduction for estimation of a contrast (C = 2) k;u;) can be computed from 
treatment means as 


If all r; are equal 
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t 2 
2 ki, 
SSC = — (3.6) 
d (kj/ri) 
7=1 
t 2 
SsC=r—= (3.7) 
dk; 
f=] 


There is 1 degree of freedom associated with the sum of squares for the contrast. 
The sum of squares partitions for the three contrasts in the meat storage experiment 
are shown in Table 3.2. The manual computations are illustrated in Display 3.2. 


Table 3.2 Analysis of variance with contrasts for log(count/cm?) of psychro- 
- trophic bacteria from the meat storage experiment 


Degrees of Sumof Mean 

Source Freedom Squares Square F Pr>F 
Total 11 33.80 | | 
Treatments 3 32.87 10.96 94.58 .000 
Error 8 0.93 O52 

Contrast DF Contrast SS Mean Square F re £. 
1 vs others 9:99 9.99 86.19 .000 
2 vs 3 and 4 l 0.07 0.07 0.62 453 
3 vs 4 ] 22.02 22.82 196.94 .000 


Hypotheses About Contrasts 


The usual null hypothesis states that the contrast has a zero value. For example, the 
null hypothesis for the second contrast of the meat storage experiment would be 


— pecs slis Roper cans peice? i eeeeg: sgh ise z Ss Se Lee 
e bacteriai growth for the two gas — 
ts oe nit te sg. gro\ Vess AUR SAY UT Sew. , 
, rn nee ESSRETE os REE ene ae ee ok Tag SE Ee Rae ae 
5 we 
: 
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Display 3.2 Sums of Squares Computations for Contrasts 
in the Meat Storage Experiment 


J, =748 J =5.50 Js =7.26 Jy = 3.36 


ky ky kz kg 


Ci —1 =) +1 


2 
Co: 0 2. =f eS 
0 


Cs: 0 l —] 


SSO, = 3G AD — 5.50 — 726 — 3.367713? + ( — 1) 
= 3(6.32)*/12 = 9.99 

S9C> = 3(2(5.50) — 7.26 — 3.36)7/[2° + (— 1)? +(— 1 
= 3(0.38)?/6 = 0.07 

SSC3 = 3[7.26 — 3.36]2/[12 +(— 1)?] 
= 3(3.9)?/2 = 22.82 


The two forms of the contrasts stated for the null hypotheses are equivalent 
except the coefficients of the latter differ from the former by a multiple of 2. 
The choice is purely a matter of personal preference. 

The null hypotheses for the three contrasts in the meat storage experiment are 


Ao: Cy = 341 — He — U3 — b4 = O 
Ho: C2 = 2p2 — 3 — Ha = O 
Ao: C3 = 3 — ps = 0 


Testing Contrasts with the F Test 


‘The alternate hypotheses are nonzero differences, or Hz: C #0. The null hy- 
pothesis, Hy: C = 0, is tested with 


VETS (3.9) 


where MSC = SSC is the 1 degree of freedom mean square for the contrast. 
Under the null hypothesis Fp has the F' distribution with 1 and (N — t) degrees of 
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freedom. The null hypothesis is rejected at the a level of significance if 
fy > Fa iN-2- 

The Fo test statistic for each of the contrasts is given in the column labeled “F”’ 
in Table 3.2. The column labeled “Pr > F’” shows the probability of exceeding the 
observed Fp statistic. The critical value for Fp at the a = .05 level of significance is 
F’'o5,1,8 = 5.32. The null hypothesis is rejected for contrasts C, and C3, Pr > F = 
000, and not rejected for contrast Cp, Pr > F = .453. 

Since the contrast C’, has a positive value estimate, c; = 2.11, we can conclude 
that the mean number of bacteria on the meat wrapped in the conventional commer- 
cial wrap (y,, = 7.48) was greater than the average over the meats wrapped in the 
artificial atmospheres, ;(J, + 3, + Ys.) = +(5.50 + 7.26 + 3.36) = 5.37. The con- 
trast C’3 has a positive value estimate, cz = 3.90, and we can conclude that there are 
less bacteria on the meat in the CO, (Y, = 3.36) than on the meat in the mixed gas 
atmosphere (y,; = 7.26). 


Testing Contrasts with the Student t Test 


Tests of the treatment contrasts can be conducted with the Student t test as well 
as with the F' test. The relationship between the Student t distribution and the F 
seocnaepainsie is 


Pr=F (3.10) 


where the Student ¢ has v degrees of freedom and the F statistic has 1 numerator 
degree of freedom and v denominator degrees of freedom. The relationship is al- 
ways valid when there is 1 degree of freedom in the numerator of the F statistic. 
The degrees of freedom for the Student t will be the same as that for the denomina- 
tor of the F' statistic. 

For any treatment contrast estimate c, with standard error s,, the ratio 


Confidence Intervals or P-Values? 


The P-value conveniently summarizes with a single value the significance or non- 
significance of hypotheses tests about the contrasts. However, the P-value conveys 


80 CHAPTER 3 TREATMENT COMPARISONS 


no quantitative information about the contrast. A small P-value, indicating 
significance, may or may not indicate a contrast is meaningfully different from the 
hypothesized value. 

‘onfidence intervals, conversely, provide meaningful quantitative information 
about the contrasts as well as their status with respect to the hypotheses about their 
values. The results from the Amiodarone study in Example 2.2 can be used to illus- 
trate the benefits of confidence intervals. 

Recall the estimated increase in the Amiodarone-treated rabbit ears was 
1.20°C, a clinically significant value, while those estimates for the vehicle and sa- 
line solutions were 0.13°C and 0.00°C, respectively, which were not clinically 
significant. 

If Amiodarone increased the temperature more than the vehicle solution in 
which it was carried, then Amiodarone would be seen as contributing to tissue in- 
flammation. Likewise, the comparison of the vehicle solution with the saline solu- 
tion would convey similar information about the contribution of the vehicle 
solution to tissue inflammation. 

The following table shows the two contrasts' estimates along with their stan- 
dard errors, 95% confidence intervals, and P-values for the Student ¢ test or Ff’ test. 


Contrast Case « 26 Sc (L, U) P-value 

Amiodarone—-Vehicle A 1:07 e@25 (0.55, 1.59)  .0036 
Bi 23 2)i1@75 (1.65,4.77) .0036 

Vehicle—Saline A 0.13 0.25 (—0.39,0.65) .6088 
B  0.39750:;75— (=4-977995). 6088 


The 95% confidence interval estimate for Amiodarone versus Vehicle in Case 
A indicates Amiodarone significantly elevated the ear temperature over the Vehicle 
solution with a positive lower limit and an upper limit of 1.59°C, which is 
Case B shows a threefold increase in the contrast and its standard error 
with an attendant threefold increase in the confidence interval upper limit to 
4.77°C, which is dramatically higher than that for Case A. 


The 95% confidence interval estimate for Vehicle versus Saline in Case A indi- 
cates no difference between the two solutions, with the limits including 0 and both 
limits less than clinically significant. Case B again shows the artificial threefold in- 
crease in the contrast estimate, standard error, and confidence interval limits. How- 
ever, in this latter case the limits exceed clinical significance. The P-value of .6088 
in both cases indicates no significant difference from 0 for the contrast and indi- 
cates no more than that. Case A did not approach clinical significance, but it is evi- 
dent that the limits for the interval estimate of Case B did exceed clinical 
significance. Thus, with Case B the investigator may want to consider further 
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action to achieve more precise estimates for the comparison of Vehicle and Saline 
treatments. 


Orthogonal Contrasts Convey Independent Information 


A certain class of contrasts, known as orthogonal contrasts, has special properties 
with respect to the sum of squares partitioning in the analysis of variance and with 
respect to their relationship to one another. Orthogonality implies that one contrast 
conveys no information about the other. Contrast cz, a comparison between the 
vacuum packaging and the two gas atmospheres, conveys no information about 
contrast c3, a comparison between the two gas atmospheres. 

Suppose there are two contrasts c and d, where 


t 
C=) ki, = ky, + koe, +++: + bed, 
es | 


and 


t 
d= J dG, = diy, + eGo, +++ + dG, 
i=1 


The contrasts c and d are orthogonal if 
ASSET LSA GLEE Ie a CEE STEER NPAC ES ELE ETE EE 


Nile ND aeRO gees > ene, (3.12) 
r% ry "2 Tt 
The weighted sum of crossproducts ofthe coefficients k; and d; must sum to 0. If 
all r; are equal, then c and d are orthogonal if i k;d; = 0. 

There are (t — 1) mutually orthogonal contrasts among t treatment means; each 
pair of contrasts will be an orthogonal pair. For example, there are three mutually 
orthogonal contrasts possible in one set of contrasts among four treatment means. _ 


The coefficients for two of the contrasts of the meat storage experiment, cz and 
c3, With equal treatment replications ai are ELL in cnaies 3. 3 os Si = hese: cross- 
a of _ iiss deat ( asts are , ; 


aha 
Se 


Display 3.3. Orthogonal Contrasts for the Meat Storage Experiment 


Coefficients 


_ Contrast hye te Ro k3 ka 
C9 0 2 —] — 1 
C3 0 0 | — | 


Sum of crossproducts = (0)(0) + (2)(0) + (— D0) + (— DC- 1) = 0 


82 


CHAPTER 3 TREATMENT COMPARISONS 


Another special feature of orthogonal contrasts is that the sums of squares for 
the (t — 1) orthogonal contrasts sum to the treatment sum of squares in the ana ysis 
of variance. The sums of squares for the three orthogonal contrasts of the example 
(cj, C2, and c3) sum to the treatment sum of squares within rounding errors, 
SST = 32.87 as seen in Table 3.2. 


Contrasts Among Treatments with Unequal Replication 


The Amiodarone experiment in Example 2.2 had unequal replication of the 
treatment groups. The summary statistics for the experiment are shown in Display 
3.4. 


Display 3.4 Summary Statistics from the Amiodarone 
Experiment of Example 2.2 


Treatment 


Amiodarone Vehicle Saline 
i, = 9; 1.20 013 0800 
V7 9 6 8 
t=3 N=23 s? = MSE =0.2177with 20 degrees of freedom 


- Two comparisons of interest may be (1) the contrast of the Amiodarone treat- 
ment mean with the vehicle and saline means and (2) the contrast between the vehi- 
cle and saline means. 


and 


The contrast coefficients are (2, — 1, — 1) and (0,1, — 1) with replication 
numbers (9, 6, 8). Evaluating the contrasts using Equation (3.12), 


A) ri a aa ewe), 


£0 


9 6 8 


For the contrasts to be orthogonal the values of the coefficients would have to 
be altered. 


Cy = TY;, — 392, — 493, 


and 
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The coefficients (7, — 3, — 4) and (0,1, — 1) satisfy the condition of a con- 
trast with 7 k; = 0. Evaluation by Equation (3.12) is 


WO) ovimithh):, cor demddes 
> are ey 


However, consider the first contrast, c) = 7y, — 3Y. — 4y,.. The coefficients 
imply a weighted comparison among the population means. Unless the treatment 
populations occur in the proportions 7: 3:4 the comparison does not make sense. 
There is unequal information in each of the treatment groups within the experiment; 
however, it is not necessarily so that the treatment populations would be unequally 
represented in nature. In the case of the Amiodarone experiment the contrast with 
equal weights for the treatment means is a more sensible comparison. 


Comments on Orthogonality 

To present a nicely ordered analysis of variance table, the choice of contrasts 
for a study must not be dictated by their orthogonality. Rather, the contrasts should 
be constructed to answer specific research questions. For example, it may be of 
interest in the meat storage experiment to contrast the control treatment separately 
with each of the other treatments, resulting in a non-orthogonal set of contrasts, 
[1 — fe, -1 — M3, and jy, — p4. Orthogonal contrasts purposely were chosen for the 
meat storage experiment to simplify the presentation. However, they did in the 
meantime answer some specific meaningful questions, which is after all the pur- 
pose of a research study. The research hypotheses and the treatment design should 
dictate the construction of the contrasts. 


3.3 Response Curves for Quantitative Treatment Factors 


Many studies are conducted to determine the quantitative trend relationship be- 
tween two variables. In experimental studies, one of the variables is usually under 
the control of the investigator while the other is the observed response variable. 
The factorial treatment designs discussed in Section 1.4 are an example of struc- 
tured treatment designs in which a quantitative factor may have several graded 
levels. 

For example, an experiment can be designed to measure the growth response 
of animals to increasing amounts of nutrient in the diet. The treatment design 
consists of one quantitative factor—the amount of nutrient in the diet with three 
levels, say, 0, 500, and 1000 parts per million (ppm). The objective of the study 
will be to characterize animal growth as a function of the amount of nutrient in the 
diet. 
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Polynomial Response Functions for Response Curves 


The polynomial model often used to describe trend relationships between a mea- 
sured response y and quantitative levels of a factor « is 


ae Bo se Bix =f Box? fos es 45252" +e (3.13) 


Several examples of response curves are shown in Figure 3.1. The linear, quadratic, 
and cubic polynomial response curves provide good approximations to many rela- 
tionships common to the biological and physical sciences. 


Y= Bo + Byx Y= Bo + Bix + Box? Y= Bo + Bix + Box? + Bgx3 
x x x 
(a) linear (b) quadratic (c) cubic 


Figure 3.1 Examples of polynomial response curves: (a) linear, (b) quadratic, and 
(c) cubic 


Example 3.1 Grain Production and Plant Density 


The growth and development of plants, as with any living organism, is 
dependent on the availability of sufficient nutrition. The objective for most 
cultivated grain crops is to maximize seed or grain production under a given 
cultivation and soil fertility regimen. If the grain producer plants too few 
seeds per unit of area, maximum grain production will not be realized simply 
because each plant has genetically determined maximum potential. Thus, in- 
creased grain production requires more plants per unit area. On the other 
hand, for an entirely different reason, maximum production will not be real- 
ized if an excessive number of plants are grown per unit of area. Available 
nutrition per plant becomes limiting, and the plant growth and development 
are reduced with concomitant reduction in grain production. 

The objective for a crop scientist under these circumstances is to charac- 
terize the relationship between plants per unit of area and grain production 
under a given cultivation and fertility regimen. An experiment was conducted 
to estimate a polynomial i curve to —— the relationshi 


50). Each of the five treatments caper nomen raedorisel to dates field plots in | 
a “completely randomized experiment design. The resulting grain yields are 
shown in Table 3.3. 
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Table 3.3. Amount of grain produced per plot for five plant densities - 


, Plant Density (x) 
10 20 30 40 50 


2 16.0 18.6 17.6 18.0 
11.4 15.5 20.2 19.3 16.4 
12.4 16.5 cw 17.1 16.6 
Means (7; ) 12.0 16.0 19.0 18.0 17.0 


ic 5 A ‘atic elat im 0} nsh ip be- 


The mean . responses of grain yields G.) to 


alent eee ays saw | in n Table 3.3 also suggest a quadratic response polynomial 
as an approximate model to describe the biological relationship between ‘grain —_ 


ip. \ Hie five 
however, a simpler 


valtee sig x it a —_ to fit a — a Ooty equation in v; 
and less complex second-degree equation may do just as well. 


Grain 


10 20 30 40 50 


Plants 


Figure 3.2 Grain yield versus plant density 


Simplifying with Orthogonal Polynomials 


The polynomial model can be fit to the observed data with many of the available 
computer regression programs. The trend analysis can be simplified by examining 
orthogonal contrasts among the treatment factor levels that measure the linear, qua- 
dratic, and higher level polynomial effects. These contrasts, known as orthogonal 
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polynomials, enable us to evaluate the importance of each polynomial component 
= a st acne contrast. 


x - into s-ortioeatet polynomials, fa Afr orthogonal polynomial equation 
model for the relationship between plant density and cee yield i is 


(3.14) 


where p is the grand mean and P, is the cth-order orthogonal polynomial for the 
ith level of the treatment factor. 

The transformations for the powers of x into orthogonal polynomials (P;;) up 
to the third degree are shown in Display 3.5. The tabled values of the orthogonal 
polynomials for t = 3 to 10 are given in Appendix Table XI. The values from Dis- 
play 3.5 or Appendix Table XI are valid for any distance between the values of x as 
long as the spacing is equal between all values of x (or factor levels) and replica- 
tion numbers are the same for all treatments. The constant A; at the beginning of 
each transformation makes each of the P; values an integer value. 


Display 3.5 Transformation of the Powers of z into 
Orthogonal Polynomials 


Mean: 


Linear: 


Quadratic: 


3 
; Lr rL—-2 IP 7 
Cubic. Fs =A. ( 7 _ ( 7 a 


t = number of levels of the factor «x = value of the factor level 
7 = mean of the factor levels d = distance between factor levels 


Note the term (x — %)/d occurs consistently in all of the orthogonal polynomi- 
al transformations in Display 3.5. The transformation (x — 2) centers the x values 
around 0, while dividing the result by the increment between the values of x scales 
the values to change by one unit between levels. For example, in Example 3.1 with 
z = 30and d = 10 the resulting transformation is 


=... .....20.3) 1 30 
(ar 0). 0 ue ead ae. 20 
480s. wd icles 2 
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Some of the standard computer program packages for statistical analysis are 
capable of producing the orthogonal polynomials with unequal or equal spacing be- 
tween the x values. Grandage (1958) gave a manual method for deriving the 
orthogonal polynomials with unequal spacings. 

The orthogonal polynomial transformations for the plant densities are wtbieli in 
a f= 5, © = 30, and x = 10, 20, 30, 40 ee 


et 0! 


The ——— mutually 


orthogon 
Table 3.4 Computations for orthogonal polynomial contrasts and sums of squares 


Orthogonal Polynomial Coefficients (P.; ) 


Density (x) Y;. Mean Linear Quadratic Cubic Quartic 
10 12 l —2 p —] ] 
20 16 l —] —] 2 —4 
30 19 1 0 —2 0 6 
40 18 l | — | —2 —4 
50 17 l 2 2 l l 
re _ l ] 5/6 35/12 
Sum = 2 Pay; 82 12 —14 ] 7 
Divisor-== 5). P? 5 10 14 10 70 
S8P= (Pay, yo Pe _ 43.2 42.0 0.3 21 
ae = SP sy, (a Pe 16.4 hz — 1.0 0.1 0.1 


The treatment sum of squares can be partitioned into an additive set_of_1 de- 
gree of iealbni sums of squares, one sum of squares for each of the (t — 1) orthog- 
onal polynomial contrasts. Consequently, it is possible to test sequentially the 
significance of the linear, quadratic, cubic, and so forth, terms in the model to 
determine the best-fitting polynomial equation. | 

The estimates of the ac: coefficients for the orthogonal _——— equation in 
Equation (3.14) and the sum of squares for each of the orthogonal polynomial con- 
trasts are shown in Table 3.4. The estimated orthogonal polynomial equation is 
found by substituting the estimates of p, a1, a2, a3, and a4 from Table 3.4 into 
Equation (3.14). The estimated equation is 


The analysis of variance for this experiment is shown in the top analysis of 
Table 3.5. The ratio Fo = MST'/MSE tests the global null hypothesis Hp: 
Q) = a2 = a3 = a4 = 0. At the .05 level of significance the critical region is 
fo > F'o5,4,10 = 3.48. The null oe is rejected since Fo = 29.28 exceeds the 
critical value. 

The 1 degree of freedom sum of squares partitions for each of the orthogonal 
polynomial contrasts are summarized in the analysis of variance at the bottom of 


ot 
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Table 3.5. Notice the sums of squares for the four contrasts sum to the sum of 
squares for Density, 87.60 with 4 degrees of freedom. 


Table 3.5 Analysis of variance for the orthogonal polynomial model relationship 
between plant density and grain yield 


Source of Degreesof Sumof Mean 

Variation Freedom — Squares Square F eS 
oe es ee 21.90 29.28 000 
Error 10 7.48 0.75 

Contrast DF Contrast SS Mean Square F Pr>f 
Linear oo 43.20 43.20 7.70 .000 
Quadratic 1 42.00 42.00 56.15 .000 
Cubic l 30 30 40 41 
Quartic l 2.10 2.10 2.81 Age 


We are interested in the contribution of the separate polynomial terms of the 
model. One strategy to determine the best polynomial equation is to test the signifi- 
cance of the terms in the sequence: linear, quadratic, cubic, and so forth. Beginning 
with the simplest polynomial, a more complex polynomial is constructed as the data 
requires for adequate description. 

The sequence of hypotheses is Hp: a; = 0, Ho: ag = 0, Ho: a3 = 0, and so 
forth. These hypotheses about the orthogonal polynomial contrasts are each tested 
with the F test for the respective contrasts. The ratios Fy = MSC/MSE are given 
in Table 3.5 for each of the polynomial contrasts estimated for the plant density 
study. For each sum of squares partition the null hypothesis is Hp: ac = 0 with 
critical eon. = > #05 110 = 4.96. 


salsnomskip RE cam yield and oe “density oft ae basis. oF thre fasion 
tests. 


Computing the Response Curve 


The estimated quadratic response curve without the cubic and quartic terms, a3P3 
and a4P4, from Equation (3.15) is 


¥,=9 taPit oP; = 164+ 1.2P,; — 1.0P; (3.16) 


The estimated value for a plant density of 2 = 10 is determined by substituting 
P,; = -2 and P, = 2 into Equation (3.16) as y = 16.4 + 1.2( — 2) — 1.0(2) = 12.0. 
The observed grain yields and those estimated from Equation (3.16) are shown 
for all plant densities in Table 3.6. 
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Table 3.6 Observed grain yields and those estimated from the quadratic 
orthogonal polynomial equation 


Density Coefficients Estimated Observed 
x is P, Yi Vi. 
10 —2 2 12.0 ws 
20 —1 — | 16.2 16 
30 0 —2 18.4 19 
40 | — | 18.6 18 
50 2 2 16.8 17 


The polynomial relationship expressed as a function of y and z in actual units 
of the observed variables is more informative than when expressed in units of the 
orthogonal polynomial. A direct transformation to an equation in x requires the in- 
formation in Display 3.5 and Table 3.4. The necessary quantities are \, = 1, 
A2 = 1,d = 10, = 30, and t = S. Then substituting for the P, 


y¥ = 1644+ 1.2P, — 1.0) 


of g430 
= 164+ 12) | 5 
and simplifying to 
y¥ =5.8+40.722 — 0.012" (3.17) 


The estimated line from Equation (3.17) may be plotted as shown in Figure 
3.3, along with the observed data points (squares) and treatment means 7, (filled 
circles). 


Standard Errors and Confidence Intervals for the Response Curve 


The estimated response curve is composed of estimates for several parameters. The 
quadratic equation chosen as the best-fitting equation for the relationship between 
grain production and plant density has three estimates: y = 16.4, a; = 1.2, and 
Ay = —1.0. 


The estimator for the variance of y¥ = Y¥ +0 ,P, + QP, is 
eo map Pian + Pes, (3.18) 
The variance estimator for a polynomial contrast s? is 


2 s° 


so = (r= P2) P2) (3.19) | 


and the standard error estimator for a contrast is the square root of the variance 
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Grain 


10 20 30 40 50 


Plants 


OT 
Figure 3.3. Estimated response curve, Y =5.8+0.72x — 0.0127, for grain yield 
as a function of plant density 


se = 82 (3.20) 


The variance estimates for the coefficients of the estimated quadratic equation are 
computed using s? = MSE = 0.75 from the analysis of variance in Table 3.5 and 
the divisors from Table 3.4. The variances are 


: 0.75 0.050 
5. SS FU 
: GO). % ye ae 
ZR. 2 D1 4) +] 404 

; 3 0.025 

s= = = VJ, 

oe waa68O) 

and 
st = Pees 0.018 
= ye : 
The variance of an estimated value is 
si = sz +83 PY + 85,P3 (3.21) 


= 05 + 0.025P? + 0.018P; 


Since the values of P; and P, differ for each value of plant density, the vari- 
ance of the estimated value differs for each value of plant density. The variance of 
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the estimated yield for plant density z = 10, using P,) = — 2 and P) = 2 in Equa- 
tion (3.21), is 


s2 = 0.05 + 0.025( — 2)? + 0.018(2)? = 0.222 
and the standard error is 
| Sq =4/0:222-— 0.471 
The 100(1 — a)% confidence interval for the iiamnilted value is computed from 
Y = to/2cn-t)(Sq) 


The standard errors and 95% confidence intervals for the estimated values of grain 
yield for all five plant densities using t.925,19 = 2.228 are shown in Table 3.7. 


Table 3.7 Observed and estimated grain yields and standard errors for estimated 
grain yields for each plant density 


Seed Yield Standard 95% Confidence 
Density Observed Estimated Error Interval 
10 12 12.0 0.471 (10.95, 13.05) 
20 16 16.2 0.305 (15.52, 16.88) 
30 19 18.4 0.349 (17.62, 19.18) 
40 18 18.6 0.305 (17.92, 19.28) 
50 17 16.8 0.471 (15,75, 17.85) 


The estimated response curve (Figure 3.3) has the advantage of portraying the 
relationship between y and x throughout the entire range of x values used in the ex- 
periment. With this example, it is possible to describe or estimate the grain yield for 
any plant density between 10 and 50 plants. The description of the relationship is 
not constrained, in a discussion of the results, to the five plant densities used in the 
study. 


3.4 Multiple Comparisons Affect Error Rates 


The group of contrasts exhibited in Section 3.2 are considered multiple compari- 
sons because more than one comparison was made among the treatment means. 
Any necessary number of these comparisons can be constructed in the analysis to 
help answer the research questions. 

However, multiple comparisons among treatment means can lead the investiga- 
tor, sometimes unwittingly, into a statistical minefield. The abundant number of 
available procedures increases the difficulty on the part of an investigator of choos- 
ing the appropriate method for a particular situation. 
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Error Rates for Multiple Comparisons 


The difficulties with multiple comparisons reside mainly in an understanding of the 
error rates associated with testing multiple hypotheses. Hypothesis tests have risks 
associated with decisions to reject or not reject the hypothesis. For any contrast 
among means the risk associated with declaring the contrast to be real when it is not 
is the risk of a Type l_errgor. The risk of declaring the contrast among population 
means to be equal to zero when it is not is the risk of a Type II error. 

The risks for Type I and Type II errors are inversely related to one another. 
The level of significance chosen for the hypothesis test determines the risk of Type 
I error. Sample size, variance, and size of the contrast for the true population 
means determine the Type II error rate for a given Type I error rate. 

A simple test for the difference between two treatment means is the Student ¢ 
test, with the statistic calculated as 


= itil a i (3.22) 


s2 | -- ~ 
‘7 Tj 
The significance level or probability of Type I error for a single test is a 
comparisonwise error rate, ac. It is the risk we are willing to take on a single 
comparison. | 

There are p(p—1)/2 pairwise comparisons among p treatment means. For 
example, four treatments (A, B, C, D) have 4(3)/2 = 6 possible pairs: (A, B), (A, C), 
(A, D), (B,C), (B,D) and (C, D). If the six pairs of means are tested with the to 
statistic in Equation (3.22), there is the possibility of committing 0, 1, 2, 3, 4, 5, or 
6 Type | errors if the six population means are all equal. 

With the possibility of up to six Type I errors for six tests we can use another 
form of Type I error based on the accumulated risks associated with the family of 
tests under consideration. The family is the set of pairwise comparisons for the 
example in the previous paragraph. The accumulated risks associated with a family 
of comparisons is often called the experimentwise Type I error rate, ag. It is the 
risk of making at least one Type I error among the family of comparisons in the 
experiment. 


Evaluating the Maximum Error Rate 


The experimentwise Type I error rate can be evaluated for a family of indepen- 
dent tests. However, all pairwise tests using Equation (3.22) are not independent 
since the s2 in the denominator of each of the to statistics is the same, and the num- 
erator of each test contains the same means as several of the other to statistics. 

Although the set of tests in the family just described are not independent, the 
upper limit for the value of the experimentwise Type I error rate can be evaluated 
by assuming independent tests. Suppose the null hypotheses are true for each of n 
independent tests. The probability of a Type I error for any single test is ac (the 
comparisonwise rate) with (1 — ac) as the probability of a correct decision. The 
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probability of committing x Type I errors is given by the binomial distribution as 


(3.23) 


for x = 0, 1, 2,... ,n Type I errors. The probability of having no Type I errors is 
P(x = 0) =(1 - ac)" 


The probability of committing at least one Type I error (x = 1, 2, 3,... , n) is 
P(e 2 1) = 1+ PG@=);or 


(3.24) 


- 
‘rate 


per mparisons. It is the “upper ascii of the ceapcimieeioviae Type I « 
0 se ts among a Sat fof treatment means. 


The relationship 
ac =1—(l—ag)!”” (3.25) 


expresses the comparisonwise Type I error rate as a function of the experimentwise 
Type I error rate. 
_ The relationship between ihe p two es Te error rates fora few selected values of 


ts increases. hens ae | test is eed Boi Type I errors are ewicd as 
they should be since only one Type I error can be committed. When n = 5 the risk 
probability of at least one Type I error among the five decisions has risen to 
Qe = .226, and with n = 10 the risk rises to a probability of .401 to commit at least 
one Type I error. 

The last column of Table 3.8 gives an indication of the comparisonwise Type I 
error rate required to maintain an experimentwise Type I error rate ag = .05. For 
example, when five independent tests are conducted and we want to keep the risk 
of committing at least one Type | error as low as | in 20 chances, the compari- 
sonwise error rate for each of the n = 5 tests must be ac = .01 and for n = 10 
tests it must be ac = .005. 


Table 3.8 The relationship between ag and ag for n = 1, 2, 3, 4, 5, or 10 inde- 
pendent tests 


n ag whenac = .05 ac when arp = .05 
l .050 .050 

Z .098 025 

3 143 017 

4 185 013 

5 226 010 
10 401 005 
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Which Experimentwise Error Rate? 


Some confusion exists regarding error rates for simultaneous inference since 
the error rate can be defined relative to the configuration of the population means, 
1, H2,...,Ht- The a4 genoa error rate has often been defined under the 
configuration’ i= by = ae under this configuration the familywise 
error rate is considered eer in the weak sense by Hochberg and Tamhane 
(1987). Equality of means is unlikely to be true under most circumstances. There- 
fore, when inequalities do occur the weak sense experimentwise error rate offers 
poor protection against incorrect decisions. If no constraints are put on the relation- 
ships among the j;, then the familywise error rate is controlled in the strong sense 
(Hochberg & Tamhane, 1987). The latter strong sense definition can be interpreted 
as the probability of at least one incorrect decision over all parameter configura- 
tions. Hsu (1996) also presents a detailed discussion of error rates for simultaneous 
inference. 


35 Simultaneous Statistical Inference 


The discussion in the previous section considered multiple simultaneous inferences 


ig oe OD . Recoriiae rt 

Table -2 8, the comparisonwise error + rate would have to be reduced for the three 
statements to hold simultaneously with a family error rate of ag = .05. If the con- 
trasts were independent, then ac = .017 would provide an appropriate family error 
rate under a null hypothesis of equality among the means in the contrasts. 

An investigator wants to make informed decisions from the observed data with 
a valid statistical method. The choice of method depends on the type of inference 
desired and the strength of inference desired. 

The types of contrasts most often considered by investigators are 


e planned contrasts 

e orthogonal polynomial contrasts 

e multiple comparisons with the best treatment 

e multiple comparisons with the control treatment 

e all pairwise comparisons 
The first two methods were discussed in Sections 3.2 and 3.3. The other three 
methods are considered in later sections in this chapter. 


Strength of inference refers to how much can be said about a comparison. The 
strongest inferences include statements about the direction and magnitude of the 
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difference. Simultaneous confidence interval methods provide the strongest form of 
inference. The investigator is able to assert, with a given level of confidence, in 
which direction the effects of one treatment differ from another and also how far 
removed those effects are from one another. 

Methods providing confident directions inference follow simultaneous confi- 
dence intervals in strength (Hsu, 1996). Th nods, for a given contrast 
C;, assert inequalities (C; > 0 or C; < 0) for each counaek with given levels of 
confidence that all statements are correct but say nothing about magnitude. 

Confident inequalities declare the inequality for each contrast as C; 4 0 with a 
given confidence level that all declarations are correct (Hsu, 1996). These state- 
ments make no assertion about the direction or the magnitude of the inequality. 

The weakest methods are individual comparison methods that do not consider 
the simultaneous nature of the inference. The weakness of these methods was 
demonstrated in the previous section where in Table 3.8 the probability of error for 
simultaneous statements, az, could increase considerably as the number of tests 
increased. 


Multiple Comparison Methods, Probability Tables, and Computer Programs 


Well-defined methods with explicit probability tables exist for some of the multiple 
comparison procedures discussed later in this chapter, given the study can be 
modeled as a completely randomized design with equal replication numbers for 
treatments. 

Either approximations to the probability tables or computer programs must be 
employed if these methods are used with unequal replications or more complex de- 
sign structures. 

Fortunately, a number of statistical program suites include exact computations 
for these methods. However, the documentation for the programs should be 
checked to be certain valid methods are being used. 

Approximations to the probability tables take advantage of several inequalities 
from probability theory. If the special computer routines are not available these ap- 
proximations will provide a conservative alternative. The approximation that will 
be suggested for methods later in this chapter is based on the Bonferroni inequal- 
ity because probability tables based on the inequality are readily available. This 
approximation can also be used for the less structured methods, such as small sets 
of planned contrasts and the orthogonal polynomial contrasts, to provide a measure 
of error protection for simultaneous inference. 


Bonferroni t Statistics for Simultaneous Inference 


In nferroni inequality provides a means to obtain an easy approximation to 
saiUltiDIe ci comparison error rates. The inequality, translated to our context, shows 
that the family error rate is less than or equal to the sum of the individual compari- 
son error rates. When n comparisons are made at the same comparison error rate, 
ac, the Bonferroni inequality gives the relationship 
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Sti 


Op < nag 


Tabled values of the Student 

lected values of ag are given in Appendix Table V as t,,/2,4,, for two-sided tests 
where k is the number of comparisons and v is the degrees of freedom. Values can 
also be obtained from any computer program that can compute probabilities for the 
Student t distribution for upper-tail probability ag/2k. For example, the Bonferroni 
t for k = 3 two-sided comparisons with v = 8 degrees of freedom and ag = .05 is 
t.o25,3,8 = 3.02. Equivalently, the value can be found by determining the value of 
the Student t for v = 8 degrees of freedom exceeded with probability ag/2k = 

.05/2(3) = .00833. 


Simultaneous Confidence Interval (SCI) Estimates 


Confidence interval estimates for contrasts are interval estimates that hold 
ee with confidence level 100(1 — ag)%. The simultar 
s use the Bonferroni ¢ statistic in place of the ivuatt Sadent t stitiatie: ‘The 


ended 100(1 — apf)% confidence interval is 
cx — kv (Se) (3.26) 


set of stervals that hold sinuleneously with “m4 least 95% 7 eeience geese. 
intervals are shown in Display 3.6. 


Display 3.6 Simultaneous 95% Confidence Intervals for Three 
Contrasts in the Meat Storage Experiment 


Standard 95% SCI 
Contrast Estimate Error a, UY) 
iis + (p2 + p3 + p4) ag 8 0.228 (1.42, 2.80) 
- + (us + p14) 0.19 0.241 ( — 0.54, 0.92) 
3.90 0.278 (3.06, 4.74) 


The ne SC oT intervals ‘ider than those comput indi jidual 95% confidence 
cen Table 3. L. sitters isa tee off bet etween the strength of the confidence 
statement and the interval widths. The joint confidence level is less for the shorter 


intervals in Table 3.1 and greater for the wider SCI intervals shown in Display 3.6. 
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The investigator can make joint statements with a Type I experimentwise error 
rate of .05 that (4) the average bacterial growth in artificial atmospheres is less than 
that on the meat in the commercial wrap since the lower limit of the interval is 
greater than 0; (2) there is no difference between the vacuum wrap and the average 
of the gases with respect to the bacterial growth since the interval includes 0; and 
(3) pure CO, results in fewer bacteria than the mixed gas since the lower limit of 
the interval is greater than 0. 

The three contrasts for the meat storage experiment from Table 3.1, their esti- 
mates, standard errors, and calculated tg statistics are shown in Display 3.7. The 
calculated to statistics for the first and third contrasts exceed the critical value 
t.025,3,8 = 3.02. The ¢ statistics in Display 3.7 exemplify the use of confident in- 
equalities inference if only their significance is evaluated. The magnitudes and 
signs of the contrasts and the ¢ statistics can be used to deduce the direction and 
magnitude of the comparisons, although in a less direct manner than with the confi- 
dence intervals. 


Display 3.7 Bonferroni t Tests for Three Contrasts in the 
Meat Storage Experiment 


Standard 


Contrast Estimate —_ Error to 

C’, (Commercial vs. artifical) 2.11 0.228 9.25 
Cy (Vacuum vs. gases) 0.19 0.241 0.79 
C3 (CO, vs. mixed gas) 3.90 0.278 14.03 


Scheffé’s Test for Simultaneous Inference 


Bonferroni t statistics can be used safely for a small number of pre-planned con- 
trasts with preservation of the proposed experimentwise error. A method for testing 
all possible contrasts or constructing confidence intervals for all possible contrasts 
was proposed by Scheffé (1953). The method provides the prescribed experiment- 
wise error protection for any number of contrasts. Consequently, the method is 
quite conservative and is generally used for unplanned contrasts or contrasts sug- 
gested by the data. The Scheffé test is shown in Display 3.8. 

Simultaneous 100(1 — ag)% confidence intervals for all possible contrasts are 
computed with the Scheffé statistic as 


and there is a (1 — az) probability that all intervals simultaneously include the true 
values of the respective contrasts. , 
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Display 3.8 The Scheffé Test 


t 
Consider any contrast, c = )> k;¥, , among t treatment means with standard error 
i=l 


The null hypothesis for the contrast, Hyp: C = 0, is rejected if 


|c| > S(az) (3.28) 


S(az) is the Scheffé statistic 


S(op) = sey/(t — D)Fasfe-1p (3.29) 


where fy,(t-1), 1S the F’ statistic with (¢ — 1) and v degrees of freedom exceeded 
with probability ag. Also, v is thenumber of degrees of freedom for experimental 
error variance s’, used to estimate the standard error of the contrast, s,. 


3.6 Multiple Comparisons with the Best Treatment 


In some studies, the treatments are not highly structured in their relationship to one 
another and structured contrasts are difficult to identify. On the other hand, treat- 
ments are related to one another because they all are under investigation for their 
effect on the measured response variables, and they address some specific problem 
of interest to the investigator. Some possible examples include testing toxins, 
sources of protein for diets, or mixtures of compounds for an alloy. 

Under these circumstances the investigator may want to “pick the winners.” 
The objective is to select the set of treatments or single treatment (if possible) that 
provides the most desirable result. 
ie multiple comparisons with the best (MCB) procedure from Hsu (1984) 
enables the investigator to select treatments into a subset such that the “best” popu- 
lation is included in the subset with a given level of confidence. The parameters of 
interest are 


yi — max p;, fori = 1,2,...,t 


j#i 
where max j2; is the maximum treatment mean not including j;. If y; — max yw; > 0 
je i rE | 
then eatin i is the best. On the other hand, if 4; — max p; < 0, then: treatment 2 


j#i 
is not the best. 
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MCB simultaneous confidence intervals (SCI) for 4; — max y; are constrained 


j#Fi 
to include 0 with the view that no two treatments ever have identical long run aver- 
ages (Hsu, 1996). The constrained MCB confidence interval asserts treatment 7 is 
one of the best if the interval for jz; — max yw; includes 0 or has a lower bound of 
aa jFt 
0. Conversely, if the upper interval bound for yw; — max ps; is 0, then treatment 7 is 


jFt 
not one of the best. The MCB procedure is described in Display 3.9. 


Display 3.9 Multiple Comparisons with the Best Procedure 


Calculate the difference, D;, between each treatment mean, ¥y,,and the largest 


treatment mean of the remaining treatments, max (7;), as 
jFi 


D; = 9; — max (y;), fort = 1,2,...,¢ (3.30) 
jFi 
and the quantity / 


2s? 
M = daxtw\{ — (3.31) 


where d,;,,, is the tabled statistic for one-sided comparisons in Appendix Table VI 


for an experimental error rate of a., k = t — 1 comparisons, and v degrees of free- 
dom for the experimental variance, s* = MSE. 


100(1 — a)% Simultaneous Constrained Confidence Intervals 
The lower confidence bound for jz; — max (¥;,) is 
i Pt 
Tee D,-M_ if(D;-M)<0 
ae otherwise 
and the upper confidence bound for jz; — max (Y,) is 
jFt 


D,+M if(D;+M)>0 
U = 
0 otherwise 


Example 3.2. Flow Rates Through Filters 


The MCB procedure is illustrated using an experiment conducted to evaluate 
filters with different filtering configurations. The filters were all designed to 
screen particles above a certain size. The investigators wanted to know 
which, if any, of the filters allowed the highest flow rate of a particle slurry 
under constant pressure. 
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They had developed 6 filter configurations for testing purposes. Four 
replicate filters of each configuration were constructed for the experiment. 
The 24 filters were tested in random order for a completely randomized ex- 
periment design. The average flow rates for the 6 filter types were 


Filter: A B C D E F 
Mean: 8.29 ded 7.54 8.10 8.59 7.10 


The estimate of experimental error variance for the experiment was 
MSE = 0.08 with 18 degrees of freedom. 


MCB Simultaneous Confidence Intervals 


We can calculate the 95% confidence interval for a comparison of filter B with the 
best of the other filters to illustrate the procedure. The mean for filter B is 
Y. = 7.23, and filter E has the largest mean among all the remaining filters, so that 
max (9;) = Ys = 8.59. Then Dy = 7.23 — 8.59 = — 1.36. 

i ow 

= The value for dy; in Equation (3.31) is found from Appendix Table VI with 
k = 5, ag = .05, and v = 18 degrees of freedom for MSE = 0.08. The appropri- 
ate value is do5,5,18 = 2.41, so that with r = 4 replications and MSE = 0.08 


M =2.A41 = 0.48 


2(0.08) 
4 


The required quantities are D2 - M = — 1.36—0.48 = — 1.84 and D2. + M = 
— 1.36+ .48 = — 0.88. Using the rules for upper and lower limits in Display 3.9, 
L = — 1.84 because Dy — M < 0 and U = 0 because D2 + M is not greater than 
0. The upper and lower bounds are shown for each of the comparisons in Table 3.9. 

Four of the filters (B, C, D, and F) have upper bounds of 0 and thus are not the 
“best” filters. Of the remaining two filters (A and E) neither is clearly the best, 
since both intervals include 0, which in turn implies that each is one of the best fil- 
ters with 95% confidence. In addition, their lower bounds, — 0.78 for A and 

— 0.18 for E, are also close to 0 relative to the lower bounds for the other filters. A 

lower bound close to 0 indicates the treatment is close to the best (Hsu, 1996). Note 
the SCI not only provide the means to identify the best treatment(s) but also give 
information about how far removed each of the treatments is from the best. Based 
on the lower bound of the intervals in Table 3.9 it is easy to see that filters B, C, 
and F in particular are the most removed from the best treatment. 

If a treatment is clearly the only best, then the lower bound will be 0. To illus- 
trate suppose filter E had a mean of 7; = 9.00 rather than 8.59. The next largest 
mean is filter A with y, = 8.29. The constrained confidence interval comparing fil- 
ter E with filter A would be 
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Table 3.9 MCB procedure with flow rate means of six filter types 


; | 95% SCI 
Filter 4; max (¥;) D; D;-M D,+M il, 0); — Seleer?* 
jFt 
A 8.29 8.59 —0.30 —0.78 0.18 (—0.78, 0.18) Yes 
B Lao 8.59 —136 —184 —0.88 (— 1.84, 0) No 
sé 7.54 8.59 A ee i OL ot = 153,0) No 
D 8.10 8.59 -—0.49 —0.97. .-—0.01 .(-—0.97,0) No 
i 8.59 8.29 0.30 —0.18 0.78 (—0.18, 0.78) Yes 
F 7.10 8.59 AO ed OF. oe LOL-i-6— 1.97, 0) No 


Select as “best” when D; + M > 0. 


Ds — M = 9.00 — 8.29 — 0.48 = 0.23 
Ds + M = 9.00 — 8.29 + 0.48 = 1.19 


so that with D; — M >0 and Ds; + M > 0 the constrained interval bounds are 
(O, 1.19), and the interval with a lower bound of 0 indicates filter E is the best. 
Likewise, filter A would not be one of the best under these circumstances since 


D, — M = 8.29 — 9.00 — 0.48 = — 1.19 
D,+ M =8.29 — 9.00 + 0.48 = — 0.23 


so that with D,; — M <0 and D, + M < 0 the constrained interval bounds are 
(— 1.19, 0). Thus, with an upper bound of 0 filter A would not be the best treat- 
ment or among the set of best treatments. 


Selecting the Subset with the Largest Mean 


If the only interest is which treatment or treatments constitute the “‘best” without re- 
gard to how much their effects may differ from the others, then a simple selection 
rule can be used (Hsu, 1984). The MCB rule selects a treatment into the best subset 
with a probability of correct selection, P(CS) = 1 — a, if 


DAeM SO! (3.32) 


The MCB procedure to select the best filter types with P(CS) = 0.95 uses the 
D;+ M column information given in Table 3.9. The treatments in the best subset 
are those for which D;+M>0. Only filters A and E have values of 
D;+ M > 0,0.18 and 0.78 respectively. The best subset includes these two filter 
types with a probability P(CS) = 0.95 that the best filter is in the subset A and E. 
The conclusions from the subset selection procedure do not differ from the SCI 
results. Both filter types A and E were selected to the best subset. However, the SCI 
are more informative because they indicated how close to 0 the lower bounds were 
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for each of the filter types, which in turn gives the investigator more information 
regarding flow rate performance for each of the filters relative to that for the filters 
with the highest flow rates. The subset selection procedure limits the information to 
whether a filter is in the best subset and indicates no more than that. 


Multiple Comparisons with the Smallest Mean 


In some studies the treatment with the smallest mean is the “best,” such as in the 
meat storage experiment in which a packaging would be best if it had fewer bacte- 
ria on the surface. Multiple comparisons with the smallest mean can be made with 
simple modifications of the rule for selecting the largest mean in Display 3.9. 

Calculate the difference, D;, between each treatment mean, ¥;, and the smallest 
mean of the remaining treatments, min (7;), as 

j#t 
D; = y; — min (y;) for? = 1,2, ... ,¢ (3.33) 
jFt : 

The 100(1 — a)% simultaneous confidence interval lower (L) and upper (U) 

limits are found as follows: The lower confidence bound for jz; — min py; is 
jFt 
ia if(D; —-M) <0 
b= 
0 otherwise 


and the upper confidence bound for yz; — min yp; is 


jFi 
D,+M if(D;+M)>0 
bi &e | 
(0) otherwise 


The constrained MCB confidence intervals for comparisons with the smallest 
treatment mean are interpreted just opposite of those for comparisons with the larg- 
est treatment mean. If the interval for jz; — min ps; includes 0 or has an upper bound 

t 
of 0, then treatment 7 is the best treatment. canpumate if the Jower bound is 0, then 
treatment 7 is not the best treatment. _ 

Multiple comparisons with the best as the treatment with the smallest mean is 
illustrated with the meat storage study and an objective to select the packaging 
material with the least amount of bacterial growth. The experimental error variance 
for the experiment was MSF = 0.116 with 8 degrees of freedom, and there were 
r = 3 replications. The required statistics for the critical value of M in Equation 


(3.31) are do5,3,g = 2.42 and ./2MSE/r = 0.278, so that 
2MSE 


M = dos3.8 = 2.42(0.278) = 0.67 

The treatment means and 95% simultaneous confidence interval estimates for 
comparisons with the smallest mean are shown in Table 3.10. The procedure is 
illustrated for the commercial wrap treatment with mean ¥Y, = 7.48and the 
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Table 3.10 Selection of the Treatment Subset with the Smallest Means in the 
Meat Storage Experiment 


95% SCI 
Treatment Y; min(y,) D; D,-M D;+M (L, U) Select?* 
jFi 
Commercial 7.48 3.36 4.12 3.45 4.79 (0, 4.79) No > 
Vacuum 5.50 3.30 2.14 1.47 2.81 (0, 2.81) No 
Mixed 126 . 7336 3.90 223 4.57 (0, 4.57) No 


Pure CO» 3.36 5.50 —2.14 —2.81 — 1.47 ( — 2.81, 0) Yes 
* Select as “best” when D; — MZ < 0. 


treatment with the minimum mean, pure COz, so that min (Y;) = ¥, = 3.36 to give 
»F-1 

Dy = 748 — 3:36 = 4:12; - The=-lower® limit “is 7 = since DD, — M = 
4.12 — 0.67 = 3.45 is not less than 0, and the upper limit is U = 4.79 since 
D, + M = 4.12 + 0.67 = 4.79 is greater than 0. With a lower bound of 0 we can 
assert that the commercial wrap is not the best treatment. The best treatment is pure 
COz since it is the only treatment in Table 3.10 with an upper confidence interval 
bound of 0. The upper bounds for all other treatments are considerably removed 
from 0 and clearly cannot be considered close to the best. 


Selecting the Subset with the Smallest Mean 


Again, if the only interest is which treatment or treatments constitute the best with- 
out regard to how much their effects may differ from the others, then a simple 
selection rule can be used (Hsu, 1984). The MCB rule selects a treatment into the 
best subset with a probability of correct selection, P(CS) = 1 — a, if 


D;-M <0 (3.34) 


The MCB procedure to select the best meat packaging with P(CS) = 0.95 uses the 
D; — M column information given in Table 3.10. The treatments in the best subset 
are those for which D; — M < 0.The only treatment mean with D; — M <0 is 
that for pure CO. Pure CO, is selected as the treatment with the lowest bacterial 
growth with a probability of correct selection P(CS) = 


Unequal Replications and Complex Models 


H: 1 (1996) indicates thé it critical values must be computed separately for each com- 
parison with unequal replication numbers in the completely randomized design. 
Some computer programs have incorporated routines to compute the simultaneous 
constrained confidence intervals with unequal replication numbers. No approxima- 
tions based on probability inequalities were given by Hsu (1996). 

If a more complex blocking or treatment design is used and all differences, 
[i — 143, have the same variance—that is, the design is variance balanced—then the 
standard MCB procedure in this section may be used. 
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If the variances for differences, 4; — j1;, are not all the same in more complex 
designs, then Hsu (1996) recommends several approximations based on probability 
inequalities. The approximation based on the Bonferroni inequality uses the Bon- 
ferroni t for k comparisons with v degrees of freedom at the appropriate level of a 
in place of dx. The least squares estimate, 1, — Ls; and its standard error should 
be used; they can be obtained from most computer programs. 


3./ Comparison of All Treatments with a Control 


In many studies one of the treatments acts as a control treatment for some or all of 
the remaining treatments. (Different types of control treatments were discussed in 
Section 1.4.) Determining whether the mean responses for the treatments diffe 

from that for the control treatment is sometimes of interest. Dunnett (1955) intro- 
duced a procedure for this purpose based on an experimentwise error rate. 


Simultaneous Confidence Intervals 


The tabled statistic to compute 100(1 — a)% simultaneous confidence intervals for 
differences between the individual treatment means and the control mean p; — p- 
using the Dunnett procedure is based on the same statistic used for the multiple 
comparisons with the best procedure in Section 3.6. The Dunnett test to compare 
each treatment mean, ¥;, with the control treatment mean 7, is described in Display 
3.10, 

Suppose filter F in Example 3.2 serves as a control. The five 95% SCI com- 
parisons for control versus treatment means are shown in Table 3.11. The mean dif- 
ferences between F and the k = 5 other filters appear in the third column. The 
critical value of the Dunnett statistic for a two-sided test with an error rate of 
ag = .05 is dos,5,18 = 2.76. The standard error of the difference is \/2MSE/r = 
,/2(0.08)/4 = 0.2. The Dunnett criterion is 


LAOS) = 2.702) = US 
An example calculation is illustrated with filter A. The lower limit is 


L=7, D(5, .05) = 8.29 — 7.10 — 0.55 = 0.64 


and the upper limit is 


U =y, —9¥. + DG, .05) = 8.29 — 7.10 + 0.55 = 1.74 


Filter A is superior to the control filter, F, since the lower bound of the interval is 
greater than 0. Based on the interval estimates filters A, D, and E are superior to 
filter F. Also, note that filter E is the most removed from filter F since its lower 
bound is greater than the lower bound of either filter A or D. Filters B and C do not 
differ from the control since the intervals include 0. 
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Display 3.10 The Dunnett Method for a Comparison of All 


Treatments with a Control 


100(1 — a)% Simultaneous Confidence Intervals for 4; — pz, 


The Dunnett criterion to compare k treatments to the control is 


752 : 
Dk, on) = dary | — (3.35) 


Simultaneous two-sided confidence interval estimates for the differences be-| 
tween the individual treatment means and the control means ju; — pl, are 


Vi es a. = D(k, ag) 


(3.36) 


One-sided interval lower bounds if superiority is manifested by a treatment 
mean greater than the control mean are 


Up Ue Dk, QE) (3.37) 


One-sided interval upper bounds if superiority is manifested by a treatment] 
mean /ess than the control mean are 


re oe G ae D(k, aR) 


(3.38) | 


The values of d.,,, for the two-sided or one-sided Dunnett method are found in | 
Appendix Table VI for k treatments, an experimentwise Type I error of ag, and 
y degrees of freedom for the estimate of experimental error variance. 


Table 3.11 Results of the Dunnett test comparing the mean of the control filter, F, 
with that of all other filters 


95% SCI Different from 
Filter Mean ¥,-U- 2S lv; — 9,| control?* 
F 7,=7.10 — e fa ee 
A 8.29 1 ee (0.64, 1.74) L.19 Yes 
B lao 0.13 ( — 0.42, 0.68) 0.13 No 
C 7.54 0.44 (— 0.11, 0.99) 0.44 No 
D 8.10 1.00 (0.45, 1.55) 1.00 Yes 
E 8.59 1.49 (0.94, 2.04) 1.49 Yes 


= 


*If | y; — y, | exceeds D(5, .05) = 0.55, then the filter mean is different from that of filter F (control). 


Testing Hypotheses About py; — yu. 


If the investigator only wants to know if a treatment mean is significantly different 
from the control mean with significance level a, then the following two-sided or 
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one-sided tests can be conducted: For the two-sided alternative with Ho: wi = Mc 
versus Hy: 4; ~ pe, reject the null uypathesis if 


19-9. | > Dk, az) (3.39) 


For the one-sided alternative with Ho: uw; < pw. versus Hy: 4; > pe, reject the null 
hypothesis if 


Gi; — 3.) > Dtk, ag) (3.40) 


For the one-sided alternative with Ho: uw; > uw. versus Hy: pW; < p_; reject the null 
hypothesis if 


a Oe (3.41) 


Results of the test with the two-sided alternatives are shown in Table 3.11. The 
differences for filters A, D, and E exceed D(5,.05) = 0.55, and their flow rates 
differ significantly from that for filter F. The two-sided tests exemplify a confident 
inequalities inference. We only state with error rate .05 that filters A, D, and E 
differ from filter F, and filters B and C do not differ from filter F. One can only de- 
duce indirectly from the size and sign which differences were positive and which 
were the largest as opposed to the SCI, which directly give us information about 
size and direction. 


Unequal Replications and Complex Models 


isu (1996) indicates that critical values must be computed separately for each com- 
parison with unequal replication numbers in the completely randomized design. 
Sore computer programs have incorporated routines to compute the simultaneous 
constrained confidence intervals with unequal replication numbers. An approxima- 
tion based on the Bonferroni inequality can be used that substitutes the Bonferroni t 
for k comparisons with v degrees of freedom at the appropriate level of a in place 
of da. , ky and uses ,/2s7|1/r; + 1/r,] for the standard error of the difference. 

1 more el blocking or treatment design is used and all differences, 
pie ae ee the same variance—that is, the design is variance balanced—then the 
standard procedure in this section may be used. 

If the variances for differences, jz; — pe, are not all the same in more complex 
designs, then Hsu (1996) presents several approximations, some of which are based 
on probability inequalities. The approximation based on the Bonferroni inequality 
uses the Bonferroni ¢ for k comparisons with v degrees of freedom at the appropri- 
ate level of a in place of d..,,. The least squares estimate, ji; — 4, and its standard 
error should be used; they can be obtained from most computer programs. 

Dunnett (1964) provided adjustments to the critical values in Appendix Table 
VI for unequal replication numbers. A conservative upper bound given by Fleiss 
(1986) for values in Appendix Table VI is md.x.,, where 
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ie 


m <1+0.07 (1 gs “| = = aan 


Dunnett has shown the optimal ratio of replication numbers is r/r, where r is 
he common replication number for each treatment and r, is the replication number 
for the control treatment. 


3.8 Pairwise Comparison of All Treatments 


e in ympare each treatment mean with each of the other treatment 
a neans using naiwise comparisons. The parameters of interest are all pairwise dif- 
ferences among the treatment means, py; — 1; for all 1 # 7, resulting in t(t — 1)/2 
comparisons. Most frequently, applications of these methods have an objective to 
detect significant inequalities, w; € ys; for all 2 F 7. 

The indiscriminate use of pairwise comparison procedures in this manner for 
the analysis of experimental results can lead to the tendency to place a reliance on 
statistical significance alone to drive the inferential procedure in data analysis. It is 
therefore possible to lose sight of the research objectives, and the investigator's 
focus may diverge from the pursuit of biological or physical understanding to that 
of statistical significance. 

Te sa investigators would want to make the comparisons with an experi- 

twise error ‘Tate. in the strong sense as they were made with multiple compari- 
saat the best (Section 3.6) and multiple comparisons with the control (Section 
3.7). Several methods for pairwise comparisons will be presented in this section 
with some discussion of their properties. 


The Tukey Method 


A procedure providing an experimentwise rate in the strong sense was developed 
by Tukey (1949a) for pairwise comparison of all treatment means and is used to 
obtain 100(1 — a)% simultaneous confidence intervals. The test has been called by 
various names, including the Honestly Significant Difference. The Tukey method is 
described in Display 3.11. | 
The Tukey method is based on the Studentized range statistic 
fe he: zest) — y (smallest | 
ee [2 
<, 


where 7 (largest) is the largest mean in an ordered group of means in an experiment 
and y(smallest) is the smallest of the means. The difference or range is divided by 
the standard error of a treatment mean, from which the statistic derives the name of 
Studentized range statistic. 
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Display 3.11 The Tukey Method for All Pairwise Comparisons 


For a group of k treatment means compute the Honestly Significant Difference as 


2, 
HSD(k, az) = qa.rv/ ~ (3.44) 


where qa,k,, 1S the Studentized range statistic for a range of k treatment means in| 
an ordered array. Critical values for an experimentwise error rate, ag, and v 
degrees of freedom can be found in Appendix Table VII. | 


10001 — a)™% Simultaneous Confidence Intervals 


Simultaneous two-sided interval estimates for the absolute value of all pairwise 
differences, jz; — ys; for allz < 7 , are 


9; — ¥;| + HSD(K, ag) (3.45) 
100(1 — a)% Confident Inequalities Test 


Two treatment means are declared not equal, ; — uw; 4 0, if 


|¥; — 9; | > HSD(, ag) 


The absolute difference |y; — 7,| is given in Display 3.11 for the confidence in- 
tervals because the location of the two means in the calculated difference, 
Y; — Yj, is arbitrary, with the sign of the difference depending on whether one cal- 
culates 7; — y; or ¥; — Y;. Thus, the absolute difference is equivalent to always 
subtracting the smaller mean from the larger. If the direction of a particular differ- 
ence is necessary, then calculate the interval with the sign of the difference consid- 
ered. The particulars of the study will dictate whether absolute or signed differences 
are best used for specific comparisons. 


Example 3.3. Strength of Welds 


Pairwise comparison tests are illustrated using an experiment conducted to 
compare the strength of welds produced by four different welding techniques. 
Each welding technique was used to weld five pairs of metal plates in a com- 
pletely randomized design. The average strengths for the five welds of each 
technique were 


Technique: A B C D 
Mean: 69 83 75 71 


3.8 PAIRWISE COMPARISON OF ALL TREATMENTS 109 


The estimate of experimental error variance for the experiment was 
MSE = 15 with 16 degrees of freedom. 


Applying the Tukey method to the data of Example 3.3, the Studentized range 
statistic with an experimentwise error rate of ag = .05 is found from Appendix Ta- 
ble VII as go5,4.16 = 4.05, where there are k = 4 treatment means in the ordered 
array and vy = 16 degrees of freedom for MSE =s?. The standard error is 
V MSE/r = ,/15/5 = 1.73. The computed HSD is HSD(4, .05) = 4.05(1.73) = 
7.0. The 95% SCI and results of the confident inequalities test are shown in Table 
3,12. 


Table 3.12 Results of the Tukey method for differences between weld strength 
means for four welding methods 


95% SCI Different 
Comparison ly; — 9 (LE, from 0?* 
A vs. B 14 Cai Yes 
A vs. C 6 (— 1, 13) No 
A vs. D Zz (—5, 9) No 
B vs. C 8 Ci, 13) Yes 
B vs. D 12 (5, 19) Yes 
C vs. D 4 (— 3,11) No 


* The absolute difference exceeds HSD(.05) = 7.0. 


Two treatment means are different with 95% confidence if the 95% SCI inter- 
val does not include 0. Method B is significantly different from all other methods, 
but no other treatments differ from one another. The amount method B differs from 
the other three methods can be assessed by the magnitude of the lower bound. 
Method B differs most from method A and least from method C. 

Inference on the basis of confidence inequalities declares two treatment means 
different if the absolute difference, | 7; — 9, |, exceeds HSD(4, .05) = 7.0. The 
HSD test judges the weld strengths of method B to be different from the weld 
strengths of all other methods, but no other differences are significant. However, 
inference about the direction and magnitude of the inequalities is not possible. 


Unequal Replications and Complex Models with the Tukey Method 


For unequal replications in the completely randomized design, Tukey in 1953 
(see Tukey, 1994) and Kramer (1956) proposed approximate simultaneous confi- 
dence intervals, which substitute 


(3.47) 
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for ,/s?/r in Equation (3.43). The approximation has become known as the Tukey— 
Kramer approximation, which Hayter (1984) showed to be conservative. 
If the variances for differences, 4; — j1;, are not all the same in more complex 


designs, then the least squares estimates, ji; — j;, and their variances, Si.-7 y> are 
Me 
used for an approximation to the exact form. The simultaneous intervals are 
eee -. 
li; — Big] denny] 5 85,14) (3.48) 


If a more complex blocking or treatment design is used and all differences, 
Li; — Lj, have the same variance—that is, the design is variance balanced—then the 
standard procedure in this section may be used, using the least squares estimates, 
ji; — #;, and their common variance of the difference, 5Gi,—A,) in Equation (3.48). 


The strong sense experimentwise error rate will be exact in this case. 


Tests of Homogeneity, 1 = wo = --- = Lt 


Many of the popular pairwise multiple comparison tests have been used with 
experimentwise error rates evaluated under a restricted assumption that all treat- 
ment means are equal, or uw; = 2 = --: = pz, thus providing experimentwise error 
rates in the weak sense. Some of them use a single criterion for declaration of signi- 
ficance. Other tests are referred to as multiple-range tests since they use multiple 
criteria for declaration of significance, where the value of a criterion for one com- 
parison depends upon how far apart the two means are in the ordered array of all 
the treatment means. The Least Significant Difference, LSD(a), is the most com- 
mon test of homogeneity that uses a single criterion. The Student-Newman-Keuls, 
SNK(k,a), is an example of a multiple-range test that is a test of homogeneity. Both 
tests are illustrated below. 


The Least Significant Difference (LSD) 


Each hypothesis Ho: uw; = pu; versus Hy: yw; # 4; can be tested with the Student ¢ 
statistic: 


(3.49) 


When the Type I error probability is set at some value a and the variance s? has v 
degrees of freedom, the null hypothesis is rejected for any observed value of 
| ¥; — y; | such that | to | > ty/o,. The Least Significant Difference, LSD(q), is 
an abbreviated method of conducting all possible pairwise t tests as shown in 
Display 3.12. | 
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Display 3.12 Least Significant Difference for All Pairwise Comparisons 


For any pair of observed treatment means, y;and y;, the Least Significant 
Difference is 


LSD(q@) = to/2,4] 8? E (3.50) 


The null hypothesis Ho: 4; = y4; is rejected if 
|v;- 91 > LSD(e) 


A modification by Fisher (1960) controls the weak sense experimentwise error 

. The LSD is used to test pairwise comparisons only if the null hypothesis is re- 
jected i in the analysis of variance F' test. If the null hypothesis is not rejected on the 
basis of the F' test, then all treatment means are assumed to be the same and no fur- 
ther testing is done. The procedure often is referred to as the protected LSD. 
Carmer and Swanson (1973) provided empirical demonstration that the experi- 
mentwise error rate with the protected LSD was almost the same as the significance 
level of the F' test used as the determinate when a was set at the .05 significance 
level for the LSD. 

Investigations by Finner (1990) and Hayter (1986) showed the LSD test was a 
confident inequalities method if the number of treatments was less than 3, but that 
for t > 3 it was not a confident inequalities method. 

The LSD test is illustrated with the weld data from Example 3.3. The computa- 
tion of the LSD requires the critical value for the Student ¢ test, t 925,16 = 2.12, and 
the standard error of the difference between two treatment means, ,/2MSE/r = 

215) /5 = 2.45. The computed LSD is LSD(05) = 2.12(2.45) = 5.2. 

The null hypothesis Ho: yu; = py; is rejected if 


| 5; — 9, | > LSD(.05) = 5.2 


A convenient method for testing is to form a table of differences with the means 
ordered from smallest to largest as shown in Table 3.13. The differences in the 
table are each computed as a difference between a mean of the column head and a 
mean of lesser value in the leftmost column. An asterisk indicates the differences 
that exceeded LSD(.05) = 5.2. 

The result of all pairwise comparisons in Table 3.13 indicates that the weld 
strength of method B exceeds that of all other methods, and the weld strength of 
method C exceeds that of method A. 


The Student-Newman-Keuls (SNK) Multiple-Range Test 


The SNK test is one of many multiple-range tests. It is based on the Studentized 
range statistic in Equation (3.43), but in contrast to the Tukey method it results in a 
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Table 3.13 Results of the LSD test on differences between the weld strength 
means for four welding methods 


Method 
A a B 
Method Mean 69 71 75 83 
A | 69 - 2 6* 14* 
D ree - 4 S 
CS ib) = 8* 


B 83 _ 
* The difference exceeds LSD(.05) = 5.2. 


nomogeneity test with experimentwise error rates in the weak sense (Hsu, 1996). 
The eiical value for the Studentized range with the SNK test is based on the range 
of the particular pair of means being tested within the entire set of ordered means. 
The test was developed independently by Newman (1939) and Keuls (1952) and is 
categorized as a multiple-range test since two or more ranges among the means are 
used for the test criteria. The SNK test is described in Display 3.13. 


Display 3.13 Student-Newman-Keuls Multiple Range Test 


The SNK criterion is 


: 
SNK(k, az) = da,kv\/ = ik (3.51) 


where qa,k, is the Studentized range statistic, k is the number of means in the 
range, v is the number of degrees of freedom for the estimate of experimental 
error variance s?, and ag is the experimentwise error rate for a range of k 


means. 


For the largest and smallest means in a range of k means, say y; and y,, the null 
hypothesis Ho: 4; = ps; 1s rejected if 


|v; —9; | > SNK(K, ag) 


The test is not conducted if there is a range of means of size greater than k 
containing y; and 7; that is not significant by the SNK criteria. 


The SNK test is presented here to demonstrate the methods for multiple-range 
tests and also because tables of critical values for the Studentized range statistic for 
a constant a are readily available. A multiple-range test that provides a much 
stronger inference with confident inequalities is presented after the SNK test. 
Although the other test also uses the Studentized range statistic it has not become 
popular, probably because its critical values are not as easily accessible as those for 
the SNK test. 
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The treatment means are ordered from smallest to largest 
In} S Vp] Spy So S Hy 


where Yj) 18 the treatment mean with the smallest value and Y) 1s the treatment 
mean with the largest value. The critical value for each pair of means depends on 
the number of means in the range of the particular pair of means under test. 

The SNK test of the means in Example 3.3 with an experimentwise Type I 
error rate of ag = .05 requires three critical values of the Studentized range statis- 
tic from Appendix Table VII, with ag = .05, v = 16, and k = 2, 3, and 4. With 


standard error \/ MSE/r = ,/15/5 = 1.73, the SNK statistic is computed from 
SNK(k, 05) oS q.05,k,16(1.73) fork = 2 3, 4 


The three critical values of the Studentized range statistic and SNK(k, .05) for 
Example 3.3 are 


i 2 3 4 
7.05,k,16: 3.00 -365,-4.05 
SNK(k, .05): $9 9 Gi BTD 


The critical values for the SNK test increase as the number of means in a range 
increases. As the distance between two means in the ordered array increases, a larg- 
er difference between the means is required to declare the means different from one 
another. For the minimum range of k=2 the SNK(2, .05) is equal to the 
LSD(.05) = 5.2, and for the maximum range of k = 4 the SNK(4, .05) is equal to 
the HSD(4, .05) = 7.0. The SNK test is more conservative than the LSD but less 
conservative than the HSD in the required differences for rejection of the null 
hypothesis. The results of the SNK(k, .05) test for Example 3.3 are shown in Table 
3.14. 


Table 3.14 Results of the SNK(k, .05) on differences between the weld strength 
means for four welding methods 


een nn eS SSS sss sss 


Method 
A D C B 
Method Mean 69 71 75 83 k SNK(k, .05) 
D 71 ee i 8 63 
~~ 


C ta = ™-8* — —2-— —5.2 
B 83 | 


"Differences shown are a larger mean minus a smaller mean. The difference exceeds SNK(k, .05). 


The differences between treatment means for the same range, k, are found on a 
diagonal running from the upper left to the lower right in Table 3.14. The SNK test 
commences with a comparison between the minimum and maximum means, k = 4. 
If the maximum difference does not exceed the critical value no further testing is 
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conducted. If the maximum difference is significant the differences among the 
means with range (k — 1) are tested. 

If any pair of means with range (k — 1) are not significant no further testing is 
= for any other pairs of means between that specific pair of means. By 
definition, no subgroup of means contained in a nonsignificant group of means can 
be ssidniiicnt For example, if there is no significant difference between y,) and 
Yj3). then no test should be performed for Yj) versus 7) Or Yj} Versus 73). The test 
proceeds in this manner until there are no further significant ranges. 

The maximum difference, k = 4, is 14 for B versus A. The difference exceeds 

SNK(4, .05) = 7.0, so the test proceeds with comparisons for a range of k = 3, B 
versus D and C versus A. The difference for C versus A does not exceed 
SNK(3, .05) = 6.3. Therefore, no more tests are conducted between pairs of means 
in the range C to A. They are D versus A and C. The difference for B versus D ex- 
ceeds SNK(3, .05) = 6.3, so testing continues within the groups of means between 
B and D. The difference for B versus C exceeds SNK(2, .05) = 5.2 and is the only 
test necessary with a range of k = 2 means. The SNK test judges method B to be 
different from all other methods with no differences among the other methods. 
With unequal replication numbers for the HSD and SNK tests the harmonic 
mean of the replication numbers from all treatment groups, rp, is often used in the 
standard error estimate for all comparisons to simplify the calculations. However, 
Hsu (1996) showed its use led to invalid statistical inference in general by reducing 
the confidence levels considerably. The harmonic mean r;, is 


(3.52) 


Multiple Range Tests for Confident Inequalities 


Several multiple-range tests have been developed that provide confident inequali- 
ties inference, which is a stronger inference than that provided by the SNK test. 
Einot and Gabriel (1975) proposed a choice of a, to test the difference between 
two means with a range of k in the set of ttreatment means. A modification sug- 
gested by a number of authors has led to common use as 


[= =a5) Wk 2. ra 
sms : Se | 
* 2 ifk=t—1,t ai 


for the desired experimentwise error rate ag. The Studentized range statistic for a 
range of k means would be used for the critical value as qo, 4.v- 

The test will provide confident inequalities inference if the critical values are 
nondecreasing with increasing values of k for the range (Hsu, 1996). The test can 
be conducted fairly easily if a computer program is available to compute quantiles 
for the Studentized range statistic. 

The values of a; and critical values of the Studentized range statistic calculated 
from a computer program for the weld strength example are shown below and can 
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be compared with those for the SNK test given earlier. The statistic for a decision 
regarding significance of a difference between two means is EG(k,a;) = 
dox,k,v \/ 8?/r. Recall there were t = 4 treatments, r = 5 replications, and a standard 
error of 1.73 with vy = 16 degrees of freedom for the error mean square. 


k: 2 3 4 

Qk: 025 05 05 

Gactie. 29 3.65 4.05 
EAS, 05)" “60 6.3 7.0 


Notice the values of the Studentized range statistic for k = 3 and 4 are the 
same as those for the SNK test, but the value for k = 2 is somewhat larger because 
of the different choice for ay. The SNK test uses a, = ag for all values of k. The 
significant comparisons using EG(k, a) are the same as those with the SNK test; 
however, the inference strength has increased to that of confident inequalities from 
strict homogeneity (which the SNK test provided). The differences between the two 
tests become more pronounced as the number of treatments increase. 


3.9 Summary Comments on Multiple Comparisons 


Research hypotheses and treatment designs are the engines that drive the methods 
for analysis of observed results from a study. A set of treatments structured to ad- 
dress certain research hypotheses lead naturally to planned comparisons, such as a 
set of orthogonal (or nonorthogonal) contrasts, polynomial regressions, and com- 
parisons of all treatments with a control. 


The MCB Procedure for Screening Studies 


A set of unstructured treatments challenges the investigator to select an appropriate 
protocol for decision making. Such studies include experiments to evaluate sets of 
crop cultivars, industrial products, pesticides, and so forth. The MCB procedure to 
select a subset of treatments with the desired response is the logical choice if the 
objective is to screen for the best products, pesticides, or cultivars. Subsequent 
studies may provide the possibility for more structured research hypotheses and 
treatment designs with opportunities for hypothesis testing. 


Multiple Contrasts Require Decisions About Error Rates 


Multiple contrasts require the investigator to make some decision relevant to error 
rates and power of the tests. Comparisonwise error rates are applicable if individual 
comparisons are the conceptual units of interest. Experimentwise error rates are 
appropriate if a family of comparisons is the conceptual unit of interest, and the 
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‘investigator wants to reduce the chance of too many incorrect decisions in the 
family of tests. 

Probabilities for Type II errors and the power of a test can be determined on an 
experimentwise basis, but they are usually expressed as comparisonwise rates. The 
more liberal comparisonwise error rate will result in a lower Type II error rate and 
a more powerful test given all other conditions constant. The more conservative 
investigator will utilize the strong sense experimentwise error rate for a family of 
comparisons. The family of comparisons as a conceptual unit is important to the 
investigator under these circumstances. A typical family would be the comparison 
of all treatments with a control using the Dunnett procedure. 

Most good experiments are designed to be efficient in the use of existing re- 
sources and time, and it is more efficient to design an experiment that answers 
multiple, related questions. A family of related comparisons will exist within any 
well-planned experiment, and one comparison cannot be considered in total isola- 
tion from all other comparisons; thus, tests based on experimentwise error rates are 
most appropriate. 


Choose a Pairwise Comparison Procedure Consistent with Your Philosophy 


The selection of an appropriate pairwise comparison method is difficult since they 
each have their advantages and disadvantages. The best alternative is to choose a 
test that is consistent with your philosophy and use it consistently for all of your 
pairwise comparison tests. A test with a single critical value for each experiment is 
preferable. Informative discussions on many different pairwise comparison meth- 
ods can be found in Hsu (1996) and in extensive notes on multiple comparisons 
that were written by J. W. Tukey in 1953 (Tukey, 1994). Some discussions of their 
use by Jones (1984), Carmer and Walker (1985), and Saville (1990) may prove 
useful. 

The Tukey method provides the best protection against decision errors, along 
with the strong inference about magnitude and direction of differences with the 
100(1 — a)% SCI. 

Hayter (1990) provided a method for one-sided simultaneous lower confidence 
bounds, 1; — uw; > L for all 2 > 7, which is very useful for directional inference in 
certain types of studies. The bounds are computed as 


sciptiintin nce < oene ese 
Hi — Hj — I \f —— fora. 4 (3.54) 


These bounds will be much sharper than those supplied by two-sided procedures 
such as the Tukey method. Tables of critical values for the statistic g> can be found 
in Hayter and Liu (1996). 


A General Recommendation 


The general recommendation for conducting multiple comparisons among treat- 
ments is to utilize the research hypotheses and treatment design to choose appro- 
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priate rauuleiple comparison procedures with the strength of inference desired for the 

Confidence intervals provide the strongest inference with magnitude and 
direction of eae followed in strength by confident directions and then confi- 
dent inequalities. A confidence interval procedure that provides magnitude and 
direction of inference is recommended if pairwise comparisons are your only re- 
maining alternative. 


EXERCISES FOR CHAPTER 3 


1. Use the data on traffic delay in Exercise 2.1. 


a. 


Conduct an analysis of variance for the data, and estimate the following contrasts and their 
standard errors: 

(i) acontrast between the pretimed and the average of the semi- and fully actuated signals 

(ii) a contrast between the semi- and fully actuated signals 

Compute the sum of squares of each contrast, and show that their sum is equal to the treatment 

sum of squares in the analysis of variance. 

Test the null hypothesis for each contrast, Hp: C = 0, with the Student t test at the .05 level of 

significance. | 

Test the null hypothesis in part (c) with the F' test at the .05 level of significance. 

What is the relationship between the two tests in parts (c) and (d)? 


2. Use the data on serum T3 concentrations from the experiments with chickens in Exercise 2.3. 
Contrasts of interest were the serum T3 concentration differences between successive stages: (1) 
premolt versus fasting, (2) fasting versus 60 grams of bran, (3) 60 grams of bran versus 80 grams 
of bran, and (4) 80 grams of bran versus laying mash. 


a. 


b. 
. 
d 


Estimate each of the contrasts and their standard errors. 

Test the null hypothesis for one of the contrasts with the Student ¢ test. 

Test the null hypothesis for one of the contrasts with the F' test. 

Suppose you were to test the four contrasts each with a comparisonwise error rate of .05. 
Compute the maximum experimentwise error rate for this family of four tests. 


3. Use the data on lettuce yields in Exercise 2.2. 


a. 
b. 


C. 
d. 


Compute the analysis of variance for the data. 

Determine the best polynomial response function that describes the relationship between 
lettuce yield and nitrogen fertilizer at the .05 level of significance. 

See Table 3.7. Construct a similar table for the results of the current problem. 

Plot the observed means along with the estimated equation. 


4. Use the data from Exercise 2.1. 


a. 


b. 


Compute the 95% SCI for multiple comparisons with the best with “best” defined as the signal 
type with the shortest stopped time delay. 

Select the signal type(s) with the shortest stopped time delay with a probability of correct 
selection of .95. 
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5. 


A hospital clinical laboratory measures the concentration of cholesterol in patient serum samples 
with a spectrophotometer. On one particular day the laboratory analyzed samples from eight 
patients. Two samples from each patient were prepared for analysis. The data that follow are 
concentrations of cholesterol (mg/d1) . 


Patient Cholesterol (mg/dl) 

l 167.3, 166.7 
186.7, 184.2 
100.0, 107.9 
wA5, 21S 
148.5, 149.5 
ae a Ty Re 
161.5, 159.4 
243.6, 245.5 


on NAN MN & WW LO 


a. Compute the 95% SCI for multiple comparisons with the best with “best” defined as the 
patient with the highest cholesterol level. 

b. Select the subset of patients that contains the patient with the highest ees count with a 
probability of correct selection of .95. 


A set of comparisons of interest on the chicken experiment in Exercise 2.3 was the comparison of 

serum T3 for each of the other states with that for the premolt stage. 

a. Compute the 95% SCI comparisons of other stages with the premolt stage using the Dunnett 
method. 

b. What are your conclusions? 


Sections of tomato plant tissue were grown in tissue cultures with differing amounts and types of 
sugars in an experiment with five replications of four treatments in a completely randomized de- 
sign. The tissue growth of each culture is given in the table below asmm x 10. 


Control 3% Glucose 3% Fructose 3% Sucrose 


45 Fat 28 31 
Be 28 31 ay 
40 30 24 ie 
45 29 28 33 
42 a Zf 34 


a. Compute the 95% SCI comparisons of all treatments with the control treatment using the 
Dunnett method. 
b. What are your conclusions? 


The coefficients shown to you by a colleague for a set of contrasts among treatment means follow. 
He wants you to check them out. 


10. 


11. 


12. 
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Treatment A = D E 
is ] 3 — | — | — | 
oe: | — |] 0 — | | 
C3 — | ] — | ] — | 
C4 0 0 2 — | — | 


a. Does each of the proposed set of coefficients constitute a contrast? Justify your answer. 
b. Are C1 and C2 orthogonal? Justify your answer. 
c. Construct a contrast orthogonal to C4 that is different from the others already shown. 


Use the Scheffé test at the .05 level of significance to test the null hypotheses about the contrasts in 
Exercise 3.1. 


Use the Bonferroni ¢ test at the .05 level of significance to test the null hypothesis about the 
contrasts in Exercise 3.1. 


Use the data on serum T3 in Exercise 2.3. 

a. Conduct all pairwise comparisons with the Tukey method at the .05 significance level. 

b. Conduct all pairwise comparisons with the Least Significant Difference at the .05 significance 

level. 

c. Conduct all pairwise comparisons with the SNK multiple-range test at the .05 significance 
level. | 

How did the results differ among the three tests? 

Explain why the results differed. 

Compute the 95% SCI for all pairwise comparisons with the Tukey method. 

What additional information do you have after computing the 95% SCI? 


mmo A 


The following is a description of an experiment on human work systems. In a human work system, 
such as a factory assembly line, workers are often required to move an object to a specific location 
with their hand. 

The specific purpose of the study was to determine the accuracy with which individuals could 
reach to specific target locations on a horizontal plane (for example, the top of a table) with their 
field of view cut off from the targets. 

Previous research led to the hypothesis that distal movements (movements away from the 
body) are more accurate than proximal movements (movements toward the body). It was also 
hypothesized that movements in the cardinal directions (straight ahead, straight toward the body, 
and lateral) were more accurate than movements in other, non-cardinal, directions. 

Targets were set up on the circumference of a circle with a radius of 10 inches (Figure 3.4). 
The subject was seated so that a movement of the right hand to the 90° target position from the 
starting point was a distal (away from body) movement. A movement of the right hand to the 270° 
target was a proximal (toward body) movement. Movements to the 0° and 180° targets were lateral 
movements. Distal movements to the 45° and 135° targets were non-cardinal movements as were 
proximal movements to 225° and 315°. The 0°, 90°, 180°, and 270° targets were cardinal 
directions. 
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Sixteen right-handed male subjects were trained for the study. In the actual trial the investiga- 
tor randomly called out the target angles to the subject. The subject marked his try at the target, 
which was blocked from his view. The distance from the subject's mark to the target was recorded 
for each target. The average distance to each target location was computed from the observations 
on the 16 subjects. A smaller average value represented greater accuracy. 

Show in a table the coefficients required for a contrast among means for each of the following 
comparisons of accuracy between movement directions. 


$2 7 Distal versus proximal in general 
C2: Cardinal versus non-cardinal 
E> Lateral versus distal 
C4: Lateral versus proximal 
ees Non-cardinal distal versus non-cardinal proximal 
C6: Cardinal distal versus cardinal proximal 
90° 


45° 


I 135° 


Distal 
movement 
reference starting point 


Right hand placed at 
for each try 


i 180° 


Proximal 
movement 


! 


270° 


315° 225° 


Subject seated here 


a 


Figure 3.4 Targets set up on circle with radius of 10 inches 
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3A Appendix: Linear Functions of Random Variables 


A linear function of the random variables y;, yo, --- , y, is defined as 
n 
c= > keys = hy + koyo+ +++ +knyn 
i=1 


If the expected value or mean of y; is E(y;) = ;, then the expected value or mean 
of cis 


=E(c)=E (> hw) = s kiE(y:) = 53 Ki pi 
I 1 iat 


= kypy + koptg +--+ + kn pn 


_ The variance of a linear function c = 0 k;y; is 
Ka ae +2 oe ‘KO; 
1) i) 
For example, if c = y; — yo with k; = 1 and kj = — 1, then 


If the y,’s are independent, then o;; = 0 and 


= 


Therefore, if y,; and y2 are independent in c = y, — yo, the variance of c is 


2 ie 2 
= Oj; + 05. 


O~ 
The Sample Mean 


The mean of a sample of r independent observations from a normal distribution 
with a mean yp and variance o? is a linear function 


oud I I 
G¥=-M+-wt--+-y 
2 . ig 


1 
where k; = —. The expected value of the sample mean is 
r 
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Since the variances of the observations are all the same, 0? = 03 =--- =a? =o”, 
then the variance of the mean is 


Linear Function of Sample Means 


If t samples are independent and r; is the number of observations in the ith sample, 
then a linear function of the sample means 


Cty, Roig + + a, 


has a mean 
Ue = E(c) = ki EG) + ko Go) + °°: + EG) 


= kypry + kopa +--+ + kept 


2 2 2 
=k? (2) +k (2) +. 4h (2) 
ey r2 Tt 


If all sample variances are equal, 0? = 03 =--- = 07 =o”, then 


2 2 k2 
eae (H+ B48) 


and a variance 
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4 Diagnosing Agreement Between 
the Data and the Model 


The analysis of variance can lead to erroneous inferences if certain assumptions re- 
garding the data are not satisfied. Diagnostic methods for detecting faulty assump- 
tions are discussed in Chapter 4 along with data transformations that can be used 
to address the problems. A generalization of the linear model for the analysis is 
suggested as an alternative to data transformations. Also, a graphical method is 
introduced to evaluate how well a model fits the data. 


4.1 Valid Analysis Depends on Valid Assumptions 


The validity of estimates and tests of hypotheses for analyses derived from the lin- 
ear model rests on the merits of several key assumptions. The random experimental 
errors are assumed to be independent, be normally distributed with a mean of zero, 
and have a common variance (c”) for all treatment groups. Any disagreement be- 
tween the data and one or more of these assumptions affects the estimates of the 
treatment means and tests of significance from the analysis of variance. 

Summary discussions on the assumptions for the analysis of variance and ef- 
fects of departures from the assumptions can be found in Eisenhart (1947) and 
Cochran (1947). Ito (1980) summarized research on the validity of analysis of vari- 
ance test procedures under departures from assumptions. 


4.2 The Effects of Departures from Assumptions 


If experimental errors are positively correlated, Cochran (1947) showed that the 
actual precision of the treatment mean is less than the estimated precision. The 
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usual standard error estimate is too small. Conversely, the actual precision is greater 
than the estimated precision if the error correlations are negative. The best insur- 
ance against excessive correlation of the observations is randomization of experi- 
mental units to the treatment groups in experiments and randomly sampling 
populations for observational studies. 

If o2 differs from one group of observations to another the standard errors of 
treatment means will generally be greater than if o* is constant over all observa- 
tions. The stated significance levels for the F and t tests may be larger or smaller 
than the significance level actually realized. Theoretical studies by Box (1954a) 
produced results on the actual significance levels of the F’ test conducted at the .05 
level of significance with unequal group variances for equal and unequal replica- 
tion numbers. For a ratio of 1:3 for smallest:largest group variance the actual signi- 
ficance levels ranged from .056 to .074 for equal replication numbers but ranged 
from .013 to .14 with unequal replication numbers. With a variance ratio of 1:7 the | 
actual significance level was .12 with equal replication numbers. 

The analysis of variance F tests are quite robust against departures from the 
normal distribution. Ito (1980) cited the results of theoretical and empirical studies 
on the effects of nonnormality in which the actual significance levels ranged from 
.03 to .06 for tests conducted at the .05 level of significance. 

Ideal conditions are seldom realized in real studies. Minor departures of the 
data from independence, the assumed normal distribution, and homogeneous vari- 
ances generally will not cause large changes in the efficiency of estimates and 
significance levels of tests. Gross departures, especially excessive heterogeneity of 
variance or some variance heterogeneity with unequal replication numbers, can 
seriously affect statistical inferences. The remainder of the discussion will focus on 
those situations. 


4.3. Residuals Are the Basis of Diagnostic Tools 


The observed residuals form the basis for many of the primary diagnostic tools 
used to check the adequacy of linear model assumptions. The residuals are esti- 
mates of the experimental errors computed as the differences between the 
observations and the estimates of the treatment means, or 


(4.1) 


Examining the magnitude of the residuals and their relationship to other variables is 
recommended as the first step in the diagnostic process. 

Residuals are used to provide visual evaluations of the analysis of variance 
assumptions for ci variances and normal satin 25s of ase 
errors. The homogene e as s evaluated with a plot of of 
uals versus the e m s. At normal “probability plot is us 
evaluate ce norm 3 mption. The techniques are demonstrated er 
observations from a study that do not agree satisfactorily with the linear model 
assumptions. 
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Example 4.1 Hermit Crab Counts in Coastline Habitats 


A marine biologist was interested in the relationship between different coast- 
line habitats and the populations of Hermit crabs inhabiting the site. The 
biologist counted Hermit crabs on 25 transects randomly located in each of 
six different sites of a coastline habitat. Summary statistics for the six sites, 
including the mean square for error, are given in Table 4.1. The data are 
given in Appendix 4A. 

There are 150 residuals to be calculated for the data set summarized in 
Table 4.1. For illustration, the first five residuals for the observations in site 
1, €1; = Y1; — 91, are shown in Display 4.1. 


Table 4.1 Means, standard deviations, and minimum and maximum values for 
Hermit crab counts from transects in six different coastline sites 


Standard 
Site Mean Median Deviation Minimum Maximum 
1 33.80 17 50.39 0 2335 
2 68.72 10 125.35 0 466 
3 50.64 5 107.44 0 407 
4 9.24 ae 17.39 0 82 
5 10.00 2 19.84 0 94 
6 12.64 4 23.01 0 95 


MSE = 5170 with 144 degrees of freedom 
Source: Department of Ecology and Evolutionary Biology, University of Arizona. 


Display 4.1 Observations, Mean, and Residuals for Site 1 from 
Hermit Crab Study 


Transect Vij 
] 


A Probability Plot of the Residuals to Evaluate 
the Normal Distribution Assumption 


A eae wee EMG 
a cage eS 
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that for the standard normal distribution. A normal probability plot arranges the 
residuals in increasing order and plots them against corresponding guantiles! of the 
standard normal distribution. The ith-ordered residual has a cumulative frequency 
of i/N in a sample of size NV. 

The quantile of a corresponding standard normal variable is determined for a 
cumulative proportion? of f; = (¢ —0.5)/N. The values of the five smallest and 
five largest residuals and the corresponding standard normal quantiles for the 
Hermit crab counts are shown in Table 4.2. 


Table 4.2 Ordered residuals, cumulative probability (f;), and corresponding 
standard normal quantiles for the Hermit crab counts 


een eee eee eee eee eee eee rere ———————— 


Normal 

Order Residual es Quantile 
1 — 68.72 0.0033 — 2.713 

— 68.72 0.0100 — 2.326 

3 — 68.72 0.0167 — 2.128 

4 | — 68.72 0.0233 — 1.989 

2 — 67.72 0.0300 — 1.881 
146 199.20 0.9700 1.881 
147 263.36 0.9767 1.989 
148 346.28 0.9833 Z.128 
149 356.36 0.9900 2.320 
150 397.28 0.9967 2.713 


ee _—— 


The quantile of a normal variable is known for a given cumulative probability 
and the value, or equivalently the quantile, of a residual is known for its corre- 
sponding cumulative frequency in the residual data set. The corresponding 
quantiles of the residuals and standard normal distribution are paired and plotted as 
the X and Y values in a standard bivariate plot known as a quantile-quantile plot. 
If the - quantiles of the residuals match the quantiles of the normal variable for the 
same accumulated frequency, they will plot on a straight line. 

The normal probability plot of residuals for the Hermit crab counts is shown in 
Figure 4.1. The straight line in Figure 4.1 passes through the lower and upper quar- 
tiles (25th and 75th percentiles) of the data. The quantiles of residuals paired with 
their corresponding standard normal quantiles do not lie on the straight line 


1 The f quantile, q(f), is a value such that approximately a proportion, f, of the data are less than or 
equal to q(f). 

2 The value of f is designed to avoid a value of f = 1; if not there would be no finite value of the stan- 
dard normal deviate. Any slight modification from hos = i/Nto avoid f = 1 is usually adequate for 
these plots. The value here is used in the S-PLUS statistical programs used for the plots in this section. 
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Figure 4.1 Normal probability plot of residuals from the Hermit crab study 
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indicator of the normal probability plot. It illustrates the probability plot for a distri- 
bution that is skewed to the right relative to the standard normal distribution. The 
values above the line in the upper right-hand corner of the figure are residuals with 
positive values larger than expected from the standard normal distribution. The se- 
_ries of values above the line in the lower left-hand corner are residuals with nega- 
tive values smaller than expected from the standard normal distribution. 


Residual Plots to Evaluate the Homogeneous Variances Assumption 


Plotting the residuals ; against the estimated values of the treatment means provides a 
simple visual evaluation of the equal variances assumption for the treatment 
groups. If t the variability of the observations around the treatment means differs 
from group to group the corresponding set of residuals will reflect the differences 
in variation. A plot of the residuals versus the estimated site means for the Hermit 
crab counts is shown in Figure 4.2. 

The plot reflects the differences in the standard deviations among the sites in 
Table 4.1. The dispersion of the residuals varies considerably across the six sites. 
The variability of the residuals increases with the value of the estimated means. If 
the variances are heterogeneous the plot of residuals versus estimated values often 
has the funnel-shaped appearance shown in Figure 4.2. The asymmetry of the 
residuals around the zero value (dashed line) indicates an asymmetric distribution 
of the observations with a long tail to the right. 
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Figure 4.2 Plot of residuals versus estimated site means for the Hermit crab 
counts 


The spread-location or s-/ plot is another residual plot that can be even more 
revealing of heterogeneous variances. Trends in the s-/ plot can be used to reveal 
relationships that may exist between the treatment group means and treatment 
group variances. 

The square roots of the absolute values of the residuals, \/ e;;|, are used to mea- 
sure spread of the residuals, since the size of the absolute values of the residuals 
will reflect the spread or variation within a treatment group. The square roots 
remove some of the asymmetry in the absolute residuals. The estimated treatment 
group means measure Jocation. The s-/ plot for the Hermit crab study is shown in 
Figure 4.3. 

The medians of the ,/|e;;| for each of the sites are joined by straight lines in 
the s-/ plot and show the increase intheirmagnitude with an increase in the site 
means. This increasing trend in the magnitude of the absolute values of the residuals 
reflects the increase in site variance as the site means increase. 


Statistical Tests for Homogeneous Variances 


Levene (Med) Test 


Many formal statistical tests for homogeneity of variances exist for completely 
randomized designs. Conover, Johnson, and Johnson (1981) compared 56 such 
tests and found one of the best to be the Levene (Med) test. 
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Square root absolute residuals 
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Figure 4.3 Spread-location plot for the residuals from the Hermit crab study 


Let y;; be the jth observation in the 2th treatment group and 7; be the median of 
the ith treatment group. Let z,; = |y;; — Yi| be the absolute value of the difference 
between an observation and the median in the ith treatment group. To test for 
homogeneity of the variances, compute the one-way analysis of variance for z;; and 
form the Fo statistic 


t 


do ri(Zi, — Z,)?/t — 1) 


MS = 
i= aE = (4.2) 
pe re 8, ee 
i=l j=l 
The null hypothesis of homogeneous variances, Hy: 0? = 0% =--- =0?, is 


rejected if Fo > Fy(z-1),w-z). The test statistic in Equation (4.2) is a modification 
of the original test introduced by Levene (1960). The modification suggested by 
Brown and Forsythe (1974) was a substitution of the median, ¥;, for the mean, 7; , 
in the calculation of z;;. The test calculations are illustrated in Table 4.3 with five 
observations from each of the first three sites of the Hermit crab study. The values 
required for the Levene (Med) test and computed from the complete data set are 
MST = 14,229; MSE = 4, 860; and Fy = 14,229/4,860 = 2.93. It can be con- 
cluded that the variances are different among the sites, since the null hypothesis is 
rejected with critical region Fo > F'os5.5,144 = 2.28. 
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Table 4.3 Illustration of the Levene (Med) test for homogeneous variances with 
five observations from each of three sites in the Hermit crab study 


a 
Site Vij OF |aeg — Us| Zi. 
l 0 3 3 
0 3 
22 19 
3 0 
a 14 7.8 
2 415 14 401 
466 452 
6 8 
14 0 
12 2 172.6 
3 0 4 4 
0 4 
4 0 
13 9 
5 ] 3.6 
z= 61.3 
eee oe a es ee A 
t 
SST =)> ri(Z;, — 2.) = 92,896 
i=l 
t Tj 
SSE=) > (aj —2%)° = 216,539 
i=l j=l 
MST 92590 £216,539 
loys 7 peter pe ais wage 
F Max Test 


Several formal statistical tests are valid tests for homogeneity of variances in 
completely randomized designs when sample sizes are equal and the observations 
are normally distributed. One of the simplest to compute is the F' Max test statistic 
(Hartley, 1950). The null hypothesis tested with the F' Max statistic is 
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Ab: ee : | (4.3) 


with the alternative hypothesis that some variances differ. 
The test statistic is computed as the ratio of the largest observed variance to the 
smallest observed variance within the treatment groups, or 


2 
Fomeigeke max(s*) 


min(s?) eo 
where max(s?) and min(s?) are the largest and smallest, respectively, within treat- 
ment group variances. 

The null hypothesis is rejected at the a level of significance if fo Max > Fa 
Max, where F., Max is the value of the F’ Max variable exceeded with probability a 
for t treatment groups and v = (r — 1) degrees of freedom for each s?. Critical val- 
ues for the F' Max statistic are found in Appendix Table VIII. 

The F Maz test ordinarily would not be conducted for the crab study since the 
Hermit crab counts do not have a normal distribution. The test is conducted here 
only for illustration. From Table 4.1 the largest and smallest standard deviations are 
found in sites 2 and 4, respectively. The variances for these groups are st = 
17.39? = 302 and s3 = 125.35? = 15,712. The value of the test statistic is 


5 Hole 


Fo Max = a. | 52.03 


The critical value for a = .05, t = 6, and v = 24 interpolated in Appendix Table 
VIII is Fos Max = 3.24. The null hypothesis of homogeneous variances is rejected. 


4.4 Looking for Outliers with the Residuals 


Extremely large positive or negative values of the residuals will be far removed 
from the straight line indicator of the normal plot or far removed from the other 
values in the upper or lower boundaries of the residuals versus estimated means 
plot. The outlier potentially can affect the statistical inference because it inflates the 
estimate of experimental error variance and influences the estimate of a treatment 
mean. Outliers can be the result of errors in collecting and recording data, of 
mistakes in technique, or of a special combination of treatment and environment. 
Prior to discarding outliers it is wise to investigate the cause to avoid loss of 
valuable information. 


Standardized Residuals 


The standardized residuals are computed as 


ered ae (4.5) 
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The standardized residual (w;;) is useful for making quick checks on the presence 
of outliers. The w;; have approximately a standard normal distribution if the e;; 
have a normal distribution. A residual would be considered an outlier if the stan- 
dardized value fell outside the +3 or + 4 limits since the probability of a standard 
normal value greater than 3 or 4 standard deviations from the mean of 0 is quite 
small. 

The largest standardized residual for the Hermit crab counts from the 150th- 
ordered residual in Table 4.2 is 


The standardized residuals computed from the 148th- and 149th-ordered residuals 
in Table 4.2 are 4.82 = 346.28/,/5170 and 4.96 = 356.36/./5170, respectively . 
The three maximum residuals are more than 4 standard deviations from a mean of 
0. The normal probability plot and the residuals versus estimated values plot indi- 
cate the possibility of other outlying values. The outlying values derive from large 
counts of Hermit crabs. In this case, the biologist would want to ascertain the con- 
ditions on those particular transects that would cause the exceptionally high counts. 

Standardized residuals are used because ordinary residuals for diagnostic pur- 
poses have some disadvantages. Estimated residuals in the same treatment group 
are not independent of one another, and their variances are heterogeneous from 
group to group with unequal replications. The variance of an estimated residual is 
Ce = o*(1 — 1/r;) for the completely randomized design with r; replications for 
the ith treatment. 

The diagnostic plots with unequal replication can be influenced by the hetero- 
geneous variances of the ordinary residuals. The plots are also affected by the cor- 
relations among the residuals regardless of the replication numbers. The 
correlations among the residuals tend to make the residuals exhibit more agreement 
with the normal distribution in the probability plot. See Cook and Weisberg (1982) 
for an extensive coverage of residuals beyond the scope and intent of this book. 


Studentized Residuals 


The heterogeneous variances of the ordinary residuals can be corrected by utilizing 
the studentized residuals. The studentized residual for the completely randomized 
design is the ordinary residual divided by its estimated standard deviation 


C5 = or 
See 


The studentized residuals have a constant variance o2 


g,, = 1, but they are still corre- 
lated. They are recommended in place of ordinary residuals for residual plots from 
studies with unequal replication numbers. 

The residual plots and Levene (Med) test provide good evidence that the as- 


sumptions of homogeneous variances and normal distribution are not appropriate 


~™~ 
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for the Hermit crab data. Some observations are good candidates for outliers. If the 
departure from the assumptions, based on our judgments, is not too great, then the 
consequences of ignoring the lack of compliance are not severe. When there are 
serious departures from the assumptions, as in the Hermit crab study, some deci- 
sion must be made for the next step in the analysis of the study. The topics in the 
next section address some possible solutions. 


4.5 Variance-Stabilizing Transformations 
for Data with Known Distributions 


Transformations are used to change the scale of observations so that they conform 
more closely to the assumptions of the linear model and provide more valid infer- 
ences from the analysis of variance. The probabilities of statistical inference apply 
only to the new scale of measurement; significance levels and averages do not 
apply to the original measurements. When transformations are necessary, a com- 
mon practice is to conduct the analysis and make all inferences on the transformed 
scale but present summary means tables on the original measurement scale. 

Bartlett (1947) summarized many aspects of transformations in the analysis of 
variance. In this section, several transformations are discussed for data that are not 
normally distributed but have a known probability distribution. A transformation 
based on an empirical relationship between the sample means and variances is dis- 
cussed in Section 4.6 for data with unknown distribution. 


- Poisson Distribution 


Observations on counts of plants in quadrats, insects on plants, bacterial colonies 
on plates, blemishes on a surface, and accidents per unit of time may have the Pois- 
son distribution for which the mean is equal to the variance py, = 07. The square 
root transformation is recommended to stabilize the variances for observations 
from the Poisson distribution. The square root transformation xz = JY will have a 
constant variance a? = 0.25 for all values of ju,. If the mean is small, say wy < 3, 


then the transformation x = ,/y + ; is superior to JY for stabilizing the variances 


(Anscombe, 1948). The correction is unnecessary if the counts are all large. 


Binomial Distribution 


Observations on the number of successes in n independent trials follow the bino- 
mial distribution. Examples include proportions of defective items in manufactured 
lots, the proportion of germinated seeds, the proportion of surviving larvae in insect 
studies, and the proportion of flowering plants in a transect. The estimated binomial 
probability is 7 = y/n, where y is the number of successes in n independent trials 
with probability of success 7. The mean and variance of the estimated binomial 
probability are = 7 and o* = 7(1—~7)/n, respectively, and there will be a 
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well-defined relationship between the observed proportions and the variances in the 

observed data. | 
The arcsin or angular transformation is recommended to stabilize the vari- 

ances for observations from the binomial distribution. The arcsin transformation, 


L= sin-1 1/7, has a constant variance, 0? = 1/4n, for all 7 if the angle is ex- 
pressed in radians and o2 = 821/n if the angle is expressed in degrees. If n is 
small, say n < 50, then Anscombe (1948) recommends the substitution of 7° = 


(y+ >) /(n+ >) for 7 = y/n in the transformation. If all the observed proportions 


in the study are between7 = 0.3 and 7 = 0.7, the binomial variance is relatively 
stable and the transformation is probably not necessary. : 


Probits and Logits 


Two other transformations related to the binomial distribution are used most fre- 
quently in biological assays. The probit transformation is the value of the standard 
normal distribution that corresponds to a cumulative probability 7 = y/n. The Jogit 
transformation is the natural logarithm of the ratio 7/(1 — 7) used in biological 
assays and the analysis of survival data. Although both of these transformations 
result in an amenable statistical procedure for their intended purposes the variances 
are not stabilized and other models must be utilized for the analysis.One such 
model is discussed briefly in Section 4.7. Details for the use of these transforma- 
tions may be found in Cox (1970), Finney (1978), McCullagh and Nelder (1989), 
and Collett (1991). 


Negative Binomial 


Increases in the number of individuals counted can be related to the number of indi- 
viduals already present; the Poisson distribution then is no longer applicable to the 
problem. The counted individuals tend to occur in clusters in a contagious distribu- 
tion. Animals infected with the same disease organism, plants of the same species, 
or insects of the same species often occur in clusters as a result of the biological 
mechanisms for reproduction or disease transmission. A probability distribution 
frequently used for these data is the negative binomial with a mean pL, and variance 
o? = jy + 72; the variance increases with the mean at a rate greater than with 
the Poisson distribution. A suggested transformation for stabilizing the variance is 
the inverse hyperbolic sine transformation 7 = A~*sinh~* ./y. The variance of the 
transformed observations is o2 =0.25. The transformation requires some 
knowledge of . A substitute transformation, xz = log(y + 1), has an approximate 
linear relationship with the sinh~! transformation (Bartlett, 1947). 
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4.6 Power Transformations to Stabilize Variances 


The distribution of the observations cannot always be determined on the basis of 
sampling properties for the random variable. Under these circumstances, a transfor- 
mation can be determined on the basis of an empirical relationship between the 
standard deviation and the mean. 


Empirical Data Transformation 


The power transformation alters the symmetry or asymmetry of the frequency dis- 
tribution of the observations. The transformations are based on work by Box and 
Cox (1964) in which the standard deviation of y is supposed proportional to some 
power of the mean, or 


Tir uP (4.6) 
A power transformation of the observations 
n= y? (4.7) 
results in a standard deviation to mean proportional relationship 
esl eco Po (4.8) 


If p = 1 — @, then the standard deviation of the transformed variable x will be con- 
stant since p+ 3 — 1 = Oando, « p° in Equation (4.8). 

The transformations are frequently represented as a ladder of powers, a phrase 
originating in exploratory data analysis (Tukey, 1977; Velleman & Hoaglin, 1981). 
Display 4.2 shows the order of the ladder of powers for some of the more useful 
transformations. 


Display 4.2 Transformations in the Ladder of Powers, z = y? 


p yP Name Remarks 

2 y" Square Highest usually used 

1 y! Raw data No transformation 

3 JY Square root Poisson distribution 

0 log(y) Logarithm Holds “0” place in ladder 
- 5 17 JY Reciprocal Minus sign preserves order 


square root of observations 
Reciprocal Reexpress time to rate 


| 
pomened 


Values of p less than 1 will pull in the stretched-out upper end of the observa- 
tions and stretch out the bunched-in lower end of the observations in a distribution 
skewed to the right. Conversely, if p is greater than 1 a left-skewed distribution 
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becomes more symmetric by pulling in the stretched-out lower observations. The 
log transformation is placed at the “O” position in the ladder because its effect on 
the observations falls naturally in that position. 

The reciprocal transformation with p = —1 can be useful in studies that require 
measurement of time to the occurrence of an event. The reciprocal of time can be 
roughly viewed as the rate at which a subject of the investigation arrived at the 
event. It is tempting to assign a value of 0 to subjects for which the event never 
occurs; however, care must be taken since the event was never observed. The 
observation may better be treated as either a member of a truncated set of observa- 
tions or missing observations, depending on the circumstances. 


An Empirical Estimate for the Power Transformation 


An empirical estimate of p can be determined if estimates are available for the 


mean and standard deviation of the treatment groups. Express the relationship be- 


tween the standard deviation and the mean of the zth treatment group as o; = op? 


with a as a constant of proportionality. Take the logarithm of the expression to 
obtain 


log(a;) = log(a) + 6 log(u) (4.9) 


A plot of log(a;) versus log(y;) is a straight line with intercept log(a) and slope /. 
Estimates of the means and standard deviations can be substituted for o; and p;, 
and the estimate of 3 can be obtained from a simple linear regression analysis. The 


value of p for a variance-stabilizing transformation can be taken as p = 1 — 8, 


where B is the estimated slope for Equation (4.9). The determination of an empiri- 
cal power transformation and the effects of the transformation are illustrated with 
the Hermit crab data of Example 4.1. 

A plot of the logarithms of the six site standard deviations against the loga- 
rithms of the six site means from Table 4.1 is shown in Figure 4.4. The estimate of 


G8 from the regression results shown in the plot is B = 0.99. The estimated empiri- 
cal value for p is p = 1 — 0.99 = 0.01. The value is very close to the zero position 
in the ladder of powers, implying a logarithmic transformation for the Hermit crab 
data. 

When there are some zero counts among the observations, a small constant c, 
such that 0 < c < 1, is added to the observed count y to avoid evaluation of a 
logarithm for 0. The values of c = t or | are frequently used; a value of c = ; is 
suggested by Mosteller and Tukey (1977). 


Log of the standard deviation 
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y = 0.66 + 0.99x 
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Figure 4.4 Plot and regression estimates of log(s;) versus log(y; ) from the six 
sites for the Hermit crab data 


The Hermit crab data were transformed as x = log(y + *) because there were 
counts of zero Hermit crabs in some transects. Summary statistics for the 
transformed data are shown in Table 4.4. The normal probability plot of the 
residuals is shown in Figure 4.5; the plot of the residuals versus the estimated 
treatment means is shown in Figure 4.6; and the spread-location plot is shown in 


Figure 4.7. 


Table 4.4 Means, medians, standard deviations, minimum and maximum values, 
and maximum standardized residuals values, w,;, for each site after the 
transformation, z = log(y + 2), for the Hermit crab data 


Standard 
Site Mean Median Deviation Minimum Maximum 
os 0.94 1.24 0.99 — 0.78 237 
Z 1.00 1.01 1.06 — 0.78 26) 
3 0.76 0.71 1.05 — 0.78 2.61 
4 0.39 0.34 0.81 — 0.78 1.92 
5 0.44 0.34 a — 0.78 1.97 
6 0.36 0.62 0.96 — 0.78 1.98 


MSE = 0.8888 


Wij 
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Figure 4.5 Normal probability plot of the residuals after transformation 2 = 
log(y + 2) for the Hermit crab data 


Residuals 
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Figure 4.6 Plot ofresiduals versus estimated treatment means after transformation, 


x = log(y+ = for the Hermit crab data 
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No outliers are evident after transformation since all of the maximum values 
for the standardized residuals are less than 3 standard deviations from the mean 
(Table 4.4). Several values diverge from the straight line of the normal probability 
plot in Figure 4.5, but the appearance is much improved over that of the same plot 
for the original data in Figure 4.1. 

The dispersion of residuals in Figure 4.6 is quite similar for each of the sites, 
and the spread-location plot in Figure 4.7 shows no monotonic increase or decrease 
in the ,/{e;;|, indicating relatively homogeneous variances. The maximum and 
minimum site variances from sites 2 and 5 are s3 = 1.06? = 1.1236 and s? = 
0.767 = 0.5776, respectively (Table 4.4). The computed statistics for the Levene 
(Med) test were MST = 1.44 and MSE = 1.91 with Fo = 0.75. The null hypoth- 
esis of homogeneous variances with a critical value of F'o55144 = 2.28 was not 
rejected. The assumptions required for the analysis with the linear model appear to 
hold sufficiently well for the observations after transformation. 
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Figure 4.7 Spread-location plot for the residuals after transformation, z = 
log(y + >; for the Hermit crab data 


The Box—Cox Power Transformation for Other Designs 


The regression r method used in thi ion to estimate p for the power transforma- 
tion c = y? is only cheer retor omitletely Tandonied designs in which group 
means and group standard deviations can be estimated. The estimation of p with 
more complex experiment designs with blocking that are encountered in later chap- 
ters requires a somewhat more rigorous approach. 
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The original Box—Cox transformation is x = (y? — 1)/p for p41 and c= 
log.y for p = 0 (Box & Cox, 1964). The estimate of p can be found by maximizing 


where MS E(p) is the mean square for error from the analysis of variance using the 
transformation x = (y? — 1)/p for the given choice of p. 

A solution can be obtained by determining MSE(p) for a range of chosen 
values of p, plotting MS E(p) against p, and reading the value where the minimum 
MS'E(p) is found. The exact value for p by this method is unlikely to be used. It is 
more common to use the standard transformations shown in Display 4.2, which 
approximate the values of the estimated p. 


4.7 Generalizing the Linear Model 


The linear model used in this book assumes the experimental errors have homoge- 
neous or constant variance throughout and that their distribution is well approxi- 
mated by the normal distribution. Frequently, we find in our studies that natural 
phenomena behave linearly over the range of interest and have errors that are 
homogeneous and normally distributed. If not, we often find that some suitable 
transformation of the response variable will provide the required linearity and error 
structure. This ability to use the linear model and analysis of variance so often has 
led to their popularity in scientific investigations. However, transformations can 
lead to unsatisfactory scales for interpretation (for example, arcsin of the square 
root). 

The generalized linear model is a general class of linear models introduced by 
Nelder and Wedderburn (1972) that broadens the available variety of probabilistic 
models for experimental errors and forms of nonlinearity in the model. The linear 
model used throughout this book with normally distributed and homogeneous 
errors is a subset of this generalized form. 

The generalized linear model introduces separate functions to allow for hetero- 
geneous variances and nonlinearity. Rather than transforming the response variable, 
the generalized linear model may be better described as reexpressing the model. 
Dobson (1990) provides a compact introduction to the use of and applications for 
the generalized linear model, while a thorough coverage of the models may be 
found in McCullagh and Nelder (1989). 

Recall linear models introduced in Section 2.4 for the observed response y, 
which is a realization of the random variable Y with expectation E(Y) = py, where 
tb = Bo + fir, +-:-+>z_ is linear in the @ parameters. The x variables repre- 
sent variables controlled in the study or measured as covariates of the response y. 
The requisite assumptions are a constant o? for Y and the linear relationship be- 
tween yp and the z;. However, for many types of data a change in the mean of Y 
introduces a change in the variance. Examples include the binary response (0 or 1) 
with probability of success 7 for the binomial distribution, for which E (Y) = 7 and 
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o7(Y) = 7(1 —z)/n, and count data for the Poisson distribution, for which the 
mean is equal to the variance. 3 

The generalized linear model handles these issues naturally by introducing a 
reparametrization to allow heterogeneous variance by introducing a link function 
n = g(u), such that 7 = g(u) = So + 6iz1 +--- + Bpxp is now linear in the 6 
parameters rather than jz. For example, the link function for binary responses with 
p = 7 Is the logit link 


n= gu) = iog( +] 


so that 


e!) 


ies 1+ e7 


The range of = 7 is still in the interval [0,1], but 7 = g() can take any real 
value. With the logit link function, it is the logit that is lmear in the 6 parameters 
rather than pz = 7. 

The natural link function for the Poisson distribution is the /og link with 
71 = log() so that log(js) is linearly related to the z;. The probit link is another link 
for binary data, which is widely used for biological assays. The identity link 7 = yu 
does no reexpression and results in the usual linear model with homogeneous 
variances and normally distributed errors. 

The emergence of readily available software has popularized the use of partic- 
ular links associated with the generalized linear model, such as logistic regression 
with the logit link and log linear models or Poisson regression with the log link. 
Collett (1991) provides comprehensive coverage on the use and application of links 
for binary data. 

Estimation and analysis with generalized linear models is based on maximum 
likelihood estimation methods. Coverage of the model estimation and other issues 
associated with generalized linear models is beyond the intent and scope of this 
book. Regardless, whichever model is used for statistical inference, good statistical 
design is fundamental to valid statistical inference. 


4.8 Model Evaluation with Residual—Fitted Spread Plots 


A graphic method, the residual—fitted spread plot, helps evaluate how well the 
hypothesized linear models fit the data. It has been included here as an addendum 
to the discussions on the evaluation of assumptions about homogeneous and nor- 
mally distributed experimental errors with the residuals. Cleveland (1993) covers 
other useful graphic methods that can be used for exploratory and confirmatory 
data analysis. 
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The residual-fitted spread plot, or r-f spread plot, provides a graphic portrayal 
of the relative variation or spread in the experimental errors and the fitted values? 
of the linear model. In the case of the completely randomized design the fitted val- 
ues are the estimated treatment group means. The r—/f spread plot provides a visual 
companion to the ratio (SSE, — SSE;)/SSEy, introduced in Section 2.10 as a 
prelude to the F’ test for differences among treatment group means. The ratio 
provides a means of assessing the relative improvement of the full model over the 
reduced model for the data, wherein (SS'E, — S'S E’;) reflects the variation attribu- 
table to the components estimated or fit in the model, and SS'E¥ reflects the varia- 
tion in experimental error or residual variation after the model has been fit to the 
data. 

The r— spread plot includes one plot of the sorted residuals versus their cumu- 
lative frequency and one plot of the sorted fitted values minus their mean versus 
their cumulative frequency. The cumulative frequency, or the fvalue, is the same, 
f, =(@ —0.5)/N,i=1,2,...,N, used to obtain normal quantiles for the normal 
probability plots introduced in Section 4.3. 

The calculated ratio for the meat storage experiment in Chapter 2 with 
SSE, = 33.7996 and SSE; =0.9268 is (33.7996 — 0.9268)/0.9268 = 35.47, 
indicating the sum of squares for the fitted-values treatment group means was 35.47 
times larger than the sum of squares for the residuals. The r—f spread plot for the 
meat storage experiment is shown in Figure 4.8. 

The larger spread of the fitted values relative to the spread of the residuals 
(Figure 4.8) reflects the large value for the ratio of their respective sums of squares. 
Too often statistical significance does not give a true indication of whether the 
treatments produce meaningful physical or biological differences. The r—/ spread 
plot provides some visual evaluation of the physical differences among the treat- 
ment groups relative to the leftover residual variability, which can be used as a 
supplement to the formal F’ test in the analysis of variance. Given the differential 
spreads of the fitted and residual values for the meat storage experiment, one may 
conclude that relative to the experimental errors some of the treatments did produce 
meaningful biological differences among their respective means. 

The transformed Hermit crab data provide a striking contrast to the meat 
storage study using the r— spread plot. The analysis of variance for the 
transformed data is given in Table 4.5. An F' test rejects the hypothesis of equal site 
means with Fy, =2.06/0.89=2.31 and a P-value of .046. The ratio 
(SSE, — SSEs)/SSE;s = 10.32/127.99 = 0.08 indicates a small amount of vari- 
ation among the fitted treatment means relative to the residuals. This conclusion is 
confirmed visually with the r—f spread plot in Figure 4.9. Although the site means 
are judged to be different by the F test, it is quite apparent that the residual 
variation within each of the sites is considerable relative to the differences among 
the averages of the transformed counts for the sites. Thus, the significance of the 
biological differences among the sites may be negligible. 


3 Fitted value stems from the notion that we “fit” a model to the data and decompose the observation 
yij into two parts as y;; = 4; + €;; in the completely randomized design, where the fitted value is 7; 
and the residual is €;;. 
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Figure 4.8 Residual—fitted spread plot to compare the spreads of the residuals and 
the fitted values minus their means for the meat storage experiment in Chapter 2 


Table 4.5 Analysis of variance for Hermit crab data transformed by zx = 
log(y + §) 


Source of Degrees of Sum of Mean 
Variation Freedom Squares Square 
Total 149 138.31 

Sites 5 10.32 2.06 
Error 144 12199 0.89 


Fitted 
0 
Residuals 
0 


0.0 0:26 0.4 0.6 0.8 1.0 


f-value f-value 


Figure 4.9 Residual—fitted spread plot to compare the spreads of the residuals and 
the fitted values minus their means after transformation, x = log(y + =)s for the 
Hermit crab data 


144 CHAPTER4 DIAGNOSING AGREEMENT BETWEEN THE DATA AND THE MODEL 


EXERCISES FOR CHAPTER 4 


1. A temperature-accelerated life test was performed on a type of sheathed tubular heater. Six heaters 
were tested at each of four temperatures: 1520°F, 1620°F, 1660°F, and 1708°F. The number of 
hours to failure was recorded for each of the 24 heaters in the study. 


Test Temperature Hours to Failure 
S20" 1953, 2135, 2471, 4727, 6134, 6314 
1620° 1190, 1286, 1550, 2125, 2557, 2845 
1660° 651, 837, 848, 1038, 1361, 1543 
1708° S11, 6544651, 652, 688, 729 


Source: W. Nelson (1972), A short life test for comparing a sample with previous 
accelerated test results. Technometrics 14, 175-185. 


a. Investigate the assumptions necessary for an analysis of variance of the data. Include a normal 
probability plot of the residuals and a spread-location plot. 

b. Determine a reasonable transformation from the ladder of powers, using the slope of the 
regression line based on the logs of the standard deviations and the group means. 

c. Determine if your choice for the transformation resulted in data that reasonably agree with the 
assumptions necessary for the analysis of variance. 

d. Conduct an analysis of variance of the transformed data, and partition the sum of squares for 
temperature into orthogonal polynomial contrasts to determine the best relationship between 
temperature and your response variable. Since test temperatures were unequally spaced, use 
the following contrast coefficients: 


Temperature 1520 1620 1660 = 1708 
Linear +Q.7/3 —0051 0.238 0.585 
Quadratic 0.382 —0.637 —0.328 0.583 
Cubic — 0.078 0.584 —0.765 0.259 


2. An entomologist counted the number of eggs laid by female moths on successive days in three 
strains of tobacco budworm (USDA, Field, and Resistant) from each of 15 matings. The data that 
follow are the number of eggs laid on the third day after the mating for each female in each of the 


strains. 7 
Strain Number of Eggs per Moth 
USDA 448, 906, 28, 277, 634, 48, 369, 137, 29, 522, 319, 242, 261, 566, 734 
Field 211. 276,415, 767.16, 118, 1151, 0, 253..61, 0.22 (oe, oo 
Resistant 0, 9, 143, 1, 26, 127, 161, 294, 0, 348, 0, 14, 21, 0, 218 


Source: Dr. T. Watson and S. Kelly, Department of Entomology, University of Arizona. 
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a. The entomologist wants to conduct an analysis of variance on the egg counts. What probability 
distribution is appropriate for the data? 7 
What is the suggested transformation for the probability distribution you named in (a)? 

c. Determine a reasonable transformation from the ladder of powers, using the slope of the re- 
gression line based on the logs of the standard deviations and the group means. Does the trans- 
formation arrived at by this method agree with your suggestion in part (b)? 

d. Transform the data with your choice of transformation, and determine if the transformed data. 
agree with the analysis of variance assumptions. 


A plant breeder evaluated the rooting capability of nine bermuda grass clones in a laboratory exper- 
iment. Two replications of each clone were grown in an aerated growth solution in a completely 
randomized design. The number of nodes that rooted on the stolons of each clone follow. 


Replication 1 Replication 2 
Clone Rooted Not Rooted Rooted Not Rooted 
1 o 49 1] 53 
2 13 51 11 53 
3 13 5] 6 58 
4 6 42 4 60 
5 16 48 12 52 
6 [4s 50 9 a9 
7 8 | 56 18 46 
8 9 bs 10 54 
9 8 40. 16 48 


Source: Dr. W. Kneebone, Department of Plant Sciences, University of Arizona. 


a. The plant breeder wants to analyze the proportion of rooted stolons or proportion of rooted 
nodes. What probability distribution is appropriate for the data? 
What is the suggested transformation for the probability distribution you named in part (a)? 

c. Transform the data for the proportion of rooted stolons (or nodes) with the appropriate trans- 
formation, and conduct the analysis of variance on the transformed data. 

d. Use the multiple comparisons with the best procedure to select the subset of clones with the 
largest means and P(CS) = 0.95. 


The Ames Salmonella/microsome assay is used to investigate the potential of environmental toxic 
substances for their ability to effect heritable change in genetic material. The compound 4-nitro- 
ortho-phenylenediamine (4NoP) was tested with strain TA98 Salmonella. The number of visible 
colonies was counted on plates dosed with 4NoP. Five dose levels of 4NoP were used in this study. 
The colony counts for seven of the plates at each dose level are shown. 
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Dose (g/plate) Colony Counts 
0.0 li, Peet. 17, 18,2). 25 
0.3 39, 43, 46, 50, 52, 61, 67 
1.0 88, 92, 104, 113, 119, 120, 130 
3.0 idee 5 eI. £0), £1, leat 
10.0 562, 604, 689, 702, 710, 739, 786 


Source: B. H. Margolin, B. S. Kim, and K. J. Risko (1989), The Ames Salmo- 
nella/microsome mutagenicity assay: Issues of inference and validation. Journal of 
the American Statistical Association 84, 651-661. 


Since the data involve counts of bacterial colonies, can you safely assume the data have a Pois- 
son distribution? Explain your answer. 

The authors of the cited article suggest the negative binomial distribution as a plausible distri- 
bution. Do you agree with this conclusion? Explain your answer. 

Determine a transformation for the data such that the analysis of variance assumptions are 
sufficiently satisfied by the transformed data. Conduct an analysis of variance for the trans- 
formed data. 


Given the following random sample of N = 15 observations that have been ordered from smallest 


to largest 


ee 


14.3, 16.6, 17.3, 175, 17.8, 18.7, 16.8, 69, 20.0, 20-8, 21.4, 22.7, 23 eT 


Determine the fvalues and their standard normal quantiles. 

Plot the observations versus the standard normal quantiles. 

Interpret the plot relative to the form of distribution from which the observations were 
sampled. 


Given the following random sample of N = 16 observations that have been ordered from smallest 


to largest 


2 (354, 5, 10,28; 34, 35539, 63707) 97; 41255156, 188, 253 


Determine the (values and their standard normal quantiles. 

Plot the observations versus the standard normal quantiles. 

Interpret the plot relative to the form of distribution from which the observations were 
sampled. 
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4A Appendix: Data for Example 4.1 


Hermit crab counts in coastline sites. A marine biologist counted Hermit crabs on 
25 transects in each of six different coastline sites. The number of Hermit crabs 
counted on each of the transects follows. 


Site Counts 

| 6.0 2). 3. 17.9.6, 7. 11,11, 72, 33, 0,95, 13, 44,20, 27, 48, 104, 253, 81,22, 9,2 
2 415, 466, 6, 14, 12, 0, 3, 1, 16, 55, 142, 10, 2, 145, 6, 4, 5, 124, 24, 204, 0, 0, 56, 0, 8 
2 0, 0, 4, 13, 5, 1, 1, 4, 4, 36, 407, 0, 0, 18, 4, 14, 0, 24, 52, 314, 245, 107, 5, 6, 2 

4 60.042 2 5.4.2, 1.9, 22, 1, 3003; 28,2, 21, $, 82, Ee, 10; 2,0 

5 G1. 1.2.2 4 22, Gs, 13, 0, 99, 2, 3, 26, 30,5, 4, 441, 9,3, 0 

6 0.0, 0 2 3, 0,0, 4,4, 5, 4, 22, 0, 64,4, 4, 43, 3, 16, 19,95, G, 22, 0, 0 


< Experiments to Study Variances 


In this chapter, the statistical model for research studies about the variances of 
populations is introduced. Knowledge about the assignable causes of variation is 
useful for improving manufacturing processes, improving the genetics of crops and 
livestock, enhancing quality control in the health industry, and designing research 
Studies. The objective is to decompose the total variance into identifiable 
components. 


5.1 Random Effects Models for Variances 
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The meat storage experiment from Chapters 2 and 3 included four specific treat- 
ments with no expressed interest in any other packagings for the experiment. Thus, 
the complete treatment population of interest consisted of the four packaging 
methods. 

Each of the four packaging methods could be duplicated if the experiment was 
repeated. Under these circumstances, the statistical models used for the studies are 
referred to as fixed effects models, and the inferences are restricted to the . particular 
set of treatments in the study. 

There are other types of research studies in which we want to identify the ma- 
jor sources of variability in a system and estimate their variances. By nature of (1) 
the research objectives, (2) the treatment structure, (3) the experimental protocols, 
and (4) the type of inferences made from the observed results, the effects in the 
model are considered to be random effects, and the statistical models are referred 
to as random effects models. The following example illustrates a system in which 
knowledge about variability in identifiable components of an industrial process can 
be used to improve the process product. 
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Example 5.1 Castings of High Temperature Alloys 


A metal alloy is produced in a high-temperature casting process. Each casting 
is broken down into smaller individual bars that are used in applications re- 
quiring small amounts of the alloy. The tensile strength of the alloy is critical 
to its intended future use. 

The casting process is designed to produce bars with an average tensile 
strength above minimum specifications. Some variation in tensile strength 
among the bars is acceptable when only a small proportion of bars do not 
meet specifications (Figure 5.1(a)). However, excessive variation results in an 
unacceptable proportion of bars that do not meet specifications (Figure 
5.1(b)). 


minimum minimum 


(a) acceptable variation (b) unacceptable variation 


Figure 5.1 Acceptable (a) and unacceptable (b) variation in tensile strength 


Two components contribute to the total variation in tensile strength of the 
manufactured bars: variability among fabrication castings and inconsistencies with- 
in the casting process that affect bars from the same casting. 


An experiment was planned to isolate the variation in tensile strength due to 
the effects of different castings from that attributable to inconsistencies within the 
same casting. | | 

High-temperature castings of the alloy were taken from three randomly select- 
ed fabrications conducted in the same facility. Each casting was broken down into 
individual bars. Destructive tensile strength measurements were obtained on a 
random sample of 10 bars from each of three castings. The tensile strength data for 
each of the 30 bars in pounds per square inch (psi) are given in Table 5.1. 

The three castings used in the study represent a sample of the potential popula- 
tion of castings that could be produced in the facility. The investigators were inter- 
ested in the variation in tensile strength among castings produced by the facility; 
thus, the concern was not with the three specific castings in the experiment. 

The investigators considered the castings only a random sample of three from a 
population of castings produced by the facility. The effects of the castings will be 
random effects since they are randomly selected from a potentially infinite popu- 
lation of castings. The inferences will extend to the population of castings that 


150 CHAPTERS EXPERIMENTS TO STUDY VARIANCES 


€ 


Table 5.1 Tensile strengths (psi) of bars from three separate castings of a high- 
temperature alloy 


Casting 
] 2 5 
88.0 BB 94.2 
88.0 88.6 91.5 
94.8 90.0 92.0 
90.0 87.1 96.5 
93.0 85.6 95.6 
89.0 86.0 93.8 
86.0 91.0 fae 
o29 89.6 oS 2 
89.0 93.0 96.2 
93,0 87.5 92.) 


Source: G. J. Hahn and T. E. Raghunathan (1988), Combining information from various sources: A 
prediction problem and other industrial applications. Technometrics 30, 41-52. 


conceivably could be produced in the facility. Likewise, the individual bars are a 
random sample of bars possible from a single casting, and their effects on tensile 
strength are random effects. 

The observed tensile strength of a particular bar (y) differs from the mean of 
the process (w) by some overall error, 6 = y — pp. The components of the overall 
error are illustrated in Figure 5.2. The overall error is the sum of two components, 
6 = 6, + 6), where 6, is the error component for castings and 6, is the error com- 


ponent for bars in a casting. 
ae. 


: BP 


¥ 


Casting variation 


< 0.— 


Bar variation 


Casting mean 


estore Bae ¥ 


Figure 5.2 Two components of error in the metal casting process 
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Other Examples: Another typical study with random effects involves the inheri- 
tance of quantitative traits, such as grain yield in cultivated plant species. 
Many genetically distinct families of the crop are developed in a plant-breeding 
program. The families represent a random sample of potential families that can be 
developed by the plant breeder. The progeny of each family are regarded as a ran- 
dom sample of the progeny possible from the family. The plant breeder wants to 
partition the total variation into the separate contributions from the families and the 
progeny. 

Clinical medical laboratories routinely participate in interlaboratory studies on 
the variability of assay results requiring random effects models for the statistical 
analysis. At regular intervals samples from a large homogeneous pool of serum are 
sent to a large number of laboratories for analysis. The participating laboratories 
and the samples sent to them represent a random sample of the potential popula- 
tions of laboratories and serum samples. The investigators want to know if there is 
significant variation in assay results among the laboratories. 


5.2 A Statistical Model for Variance Components 


A suitable model to identify the sources of variation for the random effects in the 
experiment on casting high-temperature alloys is 


Variance. Compovents (5.1) 
ede] 


the a; are the random cast ffects, and the ei; are 
the random error effects for ites “iis aunties mes fects e;; and a; are assumed 
to be independent o of o one pS 


with a mean of 0 ae variance a The random effects for the groups (a;) are a 
paca to _ a aah rib from a La with. a a mean bed 9 and \ variance a 


a wipe population of effects, the differences among the specific group means, 
jt+a;, are of no particular interest. The variance of the distribution of group ef- 
fects, 07, is the focus of interest with random effects. 

The variance of an Scien a any be expressed as the sum of the two 


2 eee 2 


iances ee ang a; are called components of 
variance, and ) often is referred to as a variance 
pte sane s me nee ear aS study the variance component among 
groups (07) represents genetic variation among families, and the plant-breeder may 
be interested in the ratio of this genetic variation to the total variation (02). The 


engineer may use the estimate of os to compute percentile values for the 


variances, or Oy = g +o. 
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distribution of tensile strengths of the bars when they are to be used in a critical 
application. 

An outline of the analysis of variance for the observations is shown in Table 
5.2 with the pent mean te for the Among Groups and Within sh 
mean squares. | IFOUPS @ SW 
of Treatments and E for the s of variat is 
fects model from the fixed effects model. The computations for fhe sums of 
squares for Among Groups and Within Groups are the same as those for the Treat- 
ment and Error sums of squares given in Chapter 2 for the fixed effects model. 


Table 5.2 Analysis of variance for the one-way classification with expected mean 
squares for the random effects model 


Source of Degrees of Sum of Mean Expected 
Variation Freedom Squares Square Mean Square 
Total Pes SS Total 

Among Groups t—1 SSA MSA o2+ro? 
Within Groups N-t SSW MSW o? 


5.3 Point Estimates of Variance Components 


The analysis of variance method is used to estimate the variance components. 
analysis of variance is computed as if the model is a fixed | ee model, and. 
Picket wih Joie Call a) 

model (Table 5. 2). The observed mean squares are estimates 7" the ed mean 
squares, or 


and (5.2) 


The analysis of variance estimators of the variance components are determined 
by solving Equations (5.2) for the two unknowns. The solutions are 


G2 =MSW 
and (5.3) 
=2 _ (MSA~- MSW) 


a 


r 


The estimators in Equations (5.3) are unbiased and they have the smallest vari- 
ance of all estimators, which are both quadratic functions of the observations and 
unbiased estimators of a2 and 0%. 
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The random effects of the model are assumed to have a normal distribution. 
Given the assumption of normally distributed effects, the significance of the 
Among Groups component of variance can be tested. The null and alternate hy- 
potheses are Hj: 02 =0 and H,: o2 > 0, respectively. The test statistic is 
fo = MS A/MSW, and the null hypothesis is rejected at the a level of signifi- 
cance if Fo > Fm fb-1\ 

The analysis of variance for the tensile strength data in Table 5.1 is shown in 
Table 5.3. The estimate of the components of variance for bars within castings is 
the mean square for Within Groups, or 


G2 = MSW =5.82 
The estimate of the Among Groups component of variance is 


9 (MSA— MSW) (7394-532) 
ee eee Se ee es 


== 6.51 
re r 10 


The estimated total variance of an observation on tensile strength is G2 = 


y 
G2 +62 = 6.81 + 5.82 = 12.63. 

The Fo ratio to test the null hypothesis Hp: 02 =0 is Fy = MSA/MSW 
= 73.94/5.82 = 12.71. The null hypothesis is rejected with a probability of ex- 
ceeding fo = 12.71 equal to .000 (Table 5.3). The castings variation contributes 
significantly to the variation in the tensile strengths of the alloy. 


Table 5.3 Analysis of variance for tensile strengths of bars from three castings of 
a high temperature alloy 


Source of Degrees of Sum of Mean 

Variation Freedom Squares Squares F | a od 
Total . 29 304.99 

Among Groups 2 147.88 73.94 Fert 000 
Within Groups 27 157.10 5.82 


9.4 Interval Estimates for Variance Components 


Confidence interval estimates can be computed for both variance components. The 
exact 100(1 — a)% confidence interval estimator for 0? is 


SSW Sd SSW 
A ‘ B 
where A = x /2,(N—t) and B = ee /2),(N-t): A and B are values of the chi-square 


variable exceeded with probabilities a/2 and (1 — a/2), respectively. Values of 
chi-square are found in Appendix Table III. 


(5.4) 
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An interval with at least 100(1 — 2a)% confidence for a? is 


SSAA Fel Ba)nccans, SCAG sxdhy/Fo) 
ee eg 
PC rD 

where C = X29 4-1)» D = X(1-a/2),(t-1)» and Fo = MSA/MSW is the observed 

Fo statistic (Williams, 1962). The quantities Fy = Fojo(t-1)(nw-t) and 

Fi = Fa-a/2),(t-1),(N-t) are values of the F’ variable exceeded with probabilities 

a/2 and (1 — a/2), respectively.! 

Given SSW = 157.10, x45 7 = 40.1, and xG5,o7 = 16.2, the 90% confidence 

interval estimate of o? from Equation (5.4) is 


ISTAMog Aye 1STHAD 
6 ae ee 


(5.5) 


592 < a7 < 9,70 


With SSA = 147.34, 7 = 10; X025.2 = 738, Xo75,2 =0.5, 5 = (2.71, 
F'o95,2,.27 = 4.24, and F'975,2.27 = 0.025, the 90% confidence interval estimate of o? 
from Equation (5.5) is 


Wage | — 0.025 
147.88 | ——#2-4 2 147,884 
10(7.38) | ~ 10(0.05) 


1.34 <0? < 295.18 


The interval estimates for variance components will be quite wide when mean 
squares have small degrees of freedom. More groups of castings would provide a 
more precise interval estimate of o?. 


Interpretations of the Variance Components 


The mean tensile strength for the experiment was 7 = 90.9 with standard error 


estimate 
MSA 73.94 ; 
sy =\/ og a ae 1.57 psi 


The estimated variance of an observation on tensile strength on bars is 


a, =G* +6. = 12.63 with a standard deviation of G, = ,/12.63 = 3.55 psi. The 
variance component estimates isolated the different sources of variation in the 
casting process for alloy bars: The variance among castings accounted for 54% of 


the variation, and the variance among bars within castings accounted for 46% of 


' The value of F(;~.) cannot be read directly from Appendix Table IV, but its value can be determined 
from the relationship F,,, 14-0) = 1/Finn,a: 
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the variation. The engineer can reduce the standard deviation of ¢, = 3.55 psi by 
identifying and adjusting factors in the casting process that increase variation. The 
variation among castings can be caused by inconsistent alloy mixtures or tempera- 
ture settings from casting to casting. The variation among bars within castings can 
be caused by inconsistent cooling conditions or variations in the tensile strength 
measurement procedure. 


5.5 Courses of Action with Negative Variance Estimates 


By definition, a variance component is positive. However, estimates of o? using 
Equation (5.3) may be negative. There are several suggested courses of action in 
the case of negative estimates (Searle, 1971); Searle, Casella, & McCulloch 
(1992). 


1. Accept the estimate as evidence of a true value of zero and use zero as the 
estimate, recognizing that the estimator will no longer be unbiased. 


2. Retain the negative estimate, recognizing that subsequent calculations using 
the results may not make much sense. 


3. Interpret the negative component estimate as indication of an incorrect 
statistical model. 


4. Utilize a method different from the analysis of variance for estimating the 
variance components. 


5. Collect more data and analyze them separately or in conjunction with the 
existing data and hope that increased information will yield positive estimates. 


Searle (1971, Chapter 9) and Searle et al. (1992, Chapter 4) discuss several meth- 
ods of estimation from the extensive literature on variance component estimation, 
as well as these other actions, in greater detail. 


5.6 Intraclass Correlation Measures Similarity in a Group 


The intraclass correlation coefficient is a measure of the similarity of observations 
within groups relative to that among groups. When the similarity of the observa- 
tions within groups is very high, o? will be very small. Consequently, 2 will be a 
larger proportion of the total variation (07 = 02 + a2). The intraclass correlation, 
defined as the ratio 


Te gna (5.6) 
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is used in various disciplines. Applications arise in genetics studies with various 
measures for the heritability of quantitative traits, in reliability studies to measure 
the similarity of products from the same machine or process, in medical studies to 
measure the repeatability of successive measurements on patients, and in survey 
sampling to measure the similarity of responses among people contacted by the 
same interviewer (Koch, 1983). 

The analysis of variance was introduced by R. A. Fisher in the 1920s with an 
intraclass correlation model (Fisher, 1960). The model assumes all observations 
(y;;) have the same mean (jz) and variance (a), and any two members of the same 
group have a common correlation (97). With this model the expected mean squares 
for the analysis of variance in Table 5.2 are 


E(MSA) = o? {1+(r— 1)pr} 
and (5.7) 
E(MSW) = 0°(1 — p7) 


The estimators of p; and o” are found by equating the observed mean squares to 
the expectations shown in Equation (5.7) and solving for the unknowns. The solu- 
tions are 


9 }©6{MSA+(r-1)MSW} 
ne 
rT 


and (5.8) 


_ (MSA-~ MSW) 
Pl (MSA +(r—-l)MSW} 


The estimate of the intraclass correlation can have a minimum value of — 1/(r — 1) 
and a maximum value of 1 (Fisher, 1960) because the expected value of MSA 
must be equal to or greater than zero. 

The 100(1 — a)% confidence interval estimator for p; is 


Ig = fy 3 fyi 
Fot+(r—1)Fu Fo +(r—-1)Fi 


where Fu = Fy /2(t-1),(N-1)> ia = F—a/2),(t—-1),(N —1)> and Fo = MSA/MSW. The 
interval may be used for testing the hypothesis Ho: pr = 0, where the hypothesis is 
not rejected if the interval includes zero (Koch, 1983). 

The estimate of the intraclass correlation for castings of high-temperature al- 
loys is 


(5.9) 


(73.94 — 5.82) 


ee 
PI ~ [73.94 + 9(5.82)} 


With Fo = 12.71, F'95,2,27 = 3.35. and F'95,2.27 = 0.051, the 90% confidence inter- 
val estimate is 


5.7 UNEQUAL NUMBERS OF OBSERVATIONS IN THE GROUPS 157 


CR ee EL 
{1274-49635} — "! ~ 112-71 +.9(0.051)} 


and 
0.22 < py < 0.96 


The interval does not include zero, and a null hypothesis of zero intraclass correla- 
tion among the bars within castings is rejected. 

The interpretation of intraclass correlation can be made on the basis of the ratio 
in Equation (5.6). The numerator (oc?) reflects the variation peculiar to the 
differences among groups, whereas the denominator variance (a? + a?) pertains to 
individuals sampled randomly from the universe of all groups without regard to 
group boundaries. 

If the intraclass correlation is large, all the individuals in the same group are af- 
fected alike by the random effect (a;) common to that group. Thus, the similarity 
among individuals within groups will be greater than that among individuals from 
different groups, and o? will be small relative to 0”. 

On the other hand, a small intraclass correlation indicates dissimilarity among 
individuals within groups with o? large relative to 02. For example, competition 
among plants or animals within a group for nutritional resources could lead to 
growth disparities within a group. This could happen if more vigorous or aggres- 
sive individuals took a greater part of the nutritional resource. 


5.7 Unequal Numbers of Observations in the Groups 


The random effects model for the one-way classification with unequal numbers of 
observations per group is 


Yij = UTA + i; (5.10) 


$= 00 fo =1,2;...,7; 


with the same assumptions and interpretations given for the random model in 
Equation (5.1). The analysis of variance computations are the same as those for 
the fixed effects model with unequal replication. The expected mean squares for 
Among Groups and Within Groups are, respectively, 


E(MSA) = 02 + 190° 
and | (5.11) 
E(MSE) =o: 


where 
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ro = — y = 2 (5.12) 
The analysis of variance estimators for the variance components o? and o? are 
G2 = MSW 
and (5.13) 
~2  (MSA-— MSW) 
ee 


When the r; are unequal the confidence interval estimator for 7? in Equation (5.5) 
no longer applies. The interval estimator for o? from Equation (5.4) with (N — t) 
degrees of freedom is valid. 


5.8 How Many Observations to Study Variances? 


The null hypothesis of interest in the random effects model, Ho: 027 = 0, is tested 
with F = MS.A/MSW, and the power of the test is 


1—B=P(F > Fay,»,|Mo false) = P(F > Fay,.|o, > 0) 


When o? > 0, the distribution of F’ is the central F,,,, distribution multiplied by a 
constant 1/\?, where 


(5.14) 


The power of the test can be determined from the central F’ distribution as 


] 
oo B = P| > 53 (Foon) 


Given the number of groups (t), significance level (a), desired power (1 — #), 
and A, the required replication numbers can be determined from charts of power 
curves similar to those for the fixed effects model. 

A value for \ may be determined on the basis of a desired ratio for the variance 
components, o2/o2, or on the basis of the standard deviation of an individual 
observation, o,. Consider Example 5.1 in which the engineer manufactured several 
bars of a high-temperature alloy in each of several castings. If there was no varia- 
tion in the strength of the bars due to castings, then the standard deviation of a bar 
selected at random would be o, = o,. The engineer may want to detect an increase 
in the variability among castings (o2) that causes a certain percentage increase in 
oy. Suppose P is the fixed percentage increase in o, that is acceptable, and beyond 
which the null hypothesis would be rejected. The ratio of o, to a expressed in 
terms of P when the null hypothesis is rejected is 
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2 2 
Cte. Na Ee nia GR 


Oe Oe 


The necessary value for the ratio 0?/c? in Equation (5.14) is 
© 
a =(1+0.01P) —1 (5.15) 


Charts of power curves are given in Appendix Table X for a = .05, .01, and se- 
lected values of 1, and v2 for the F' distribution. The charts plot the power of the 
test, 1 — G, versus \, where )? is given in Equation (5.14). 


Example 5.2 Castings of High-Temperature Alloys Revisited 


In Example 5.1 the estimates of the variance components were G- = 5.82 and 
G- = 6.81. The estimated standard deviations are G. = 2.41 and G, = 3.55. 
Suppose the engineer is able to run t = 5 castings and wants to detect an 
increase in the standard deviation o, over o, of P = 35 with a power of at 
least .80 at the .05 level of significance. The required value for the ratio in 
Equation (5.15) is 


= [1 + 0.01(35)]? — 1 = 0.8225 


nes | ats 


so that A = ,/1 + 7r(0.8225). If a value of r = 10 is chosen, then A = 3. 
Entering Appendix Table X for 1, = 4 and locating the approximate position 
of the line for 12 = 45 with a = .05 the value of 1 — @ is between 0.8 and 
0.9. Therefore, the engineer could measure ten bars in each of the five cast- 
ings to detect an increase of 35% or more in the standard deviation due to the 
castings. 


5.9 Random Subsamples to Procure Data for the Experiment 


It is sometimes necessary or convenient to randomly sample subunits of the experi- 
mental units to procure the requisite data for a study 2 i 1e Observational unit in this 
case is a subsample taken from a larger experimental ‘unit. Several plants may be 


sampled from a field plot for measurements of insect js feahasion. A serum sample 
is frequently split into two or more subsamples prior to spectrophotometric analy- 
sis. Several samples of paint are extracted from replicate batches of each paint 
— to test wih esti 


n ource of variability for the obser- 

per Sirientat units. It is important to distin- 
ate eianeti the variation satnied a0 subsamples and that contributed by the 
experimental units. This distinction becomes important to estimation of standard 
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errors for treatment means and tests of hypotheses about treatments. An introduc- 
tion to this distinction was given in Examples 1.1 and 1.2 in the discussion on 
replication in Chapter 1. 

Estimates of the variance components for experimental units and for subsam- 
ples identify the amount of variation contributed by the two sources. This informa- 
tion is used in Section 5.10 to determine the relative number of experimental units 
to minimize the standard error of the treatment means or the cost of the experiment. 


Example 5.3 Pesticide Residue on Cotton Plants 


Applications of pesticides are often part of insect management programs used 
for agronomic and horticultural crops. One of the concerns following applica- 
tion of pesticides is the concentration of pesticide residue that remains on the 
plants in the field after certain periods of time. Pesticide residues are evalu- 
ated with chemical assays in the laboratory using plants sampled from field 
plots treated with the pesticide. 


Research Hypothesis: For one particular problem the investigators hypothe- 
sized that the ability to recover pesticide residue on cotton plant leaves dif- 
fered among two standard chemistry methods that were being used on a 
regular basis for the residue assays. 


Treatment Design: The treatments consisted of the two standard chemistry 
methods, methods A and B, that were used on a regular basis. 


Experiment Design: Six batches of plants, each batch from a single field 
plot, were sampled from the field and prepared for residue analysis. Three 
batches were randomly allocated to each of methods A and B in a completely 
randomized design. | 

The amount of plant material in each batch sampled from the field ex- 
ceeded the amount required for an assay in the laboratory by either chemistry 
method. Thus, two subsamples of the required quantity of plant material were 
taken from each batch of the prepared plant tissue and analyzed by the appro- 
priate method. Consequently, there were two chemistry methods, three 
batches (replications) per method, and two subsamples from each batch. The 
pesticide residues determined for each of the subsamples as micrograms per 
unit of weight are shown in Table 5.4. 


The Statistical Model with Subsamples 
When there are n subsamples from each of r experimental units for ¢ treatments the 
statistical model is 


(5.16) 


iweb, 227 Sy bi59 2, can Soe Se. , 0 


where pu is the general mean, 7; is the fixed effect of the ith treatment, e;, is the 
random experimental error effect for the 7th experimental unit of the zth treatment, 
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Table 5.4 Pesticide residue (jg) found on samples of cotton plants 


Method A Method B 
Batch Sample Yijk Vij, Yj. Batch Sample Yije Yi; Yi. 
I | 120 + 7 71 
2 110 115.0 8 TTL 
2 3 120 5 9 70 
- 100 110.0 10 1 73,0 
3 5 140 6 1] 63 
6 130°" 13350 120.0 12 68 65.5 69.8 
: y= 9a9 


Source: G. Ware and B. Estesen, Department of Entomology, University of Arizona. 


1;;, is the random effect for the kth subsample of the jth experimental unit of 
the 2th trea Pent. Tt is assumed that the e;; and d;,, are veauhs vig distributed inde- 
nents random effects with means 0 and variances o? and o%, respectively. If 
treatments are random, then the fixed treatment effects oad in Equation (5.16) are 
replaced by the random group effects (a;), which have a normal distribution with 
mean 0 and variance o?. 


The Analysis of Variance with Subsamples 


The observations, expressed as deviations from the grand mean, can be written as a 
sum of three separate deviations that represent the sources of variation in the 


experiment: ails plies on 
Set Gia 04 (5.17) 


The deviation of any observation from the grand mean shown on the left-hand side 
of Equation (5.17) is the sum of three terms. They are the 


e treatment deviation (7, —J_)[ e.g. (J. —¥.) = 120.0 — 94.9 = 25.1] 

e experimental error (¥,;, — 9;,.) [e-g.. Gir, — Hi,,.) = 115.0 — 120.0 = — 5.0] 

e sampling error (yi; — 9,;.) [e-g., (yi11 — Ys.) = 120.0 — 115.0 = 5.0] 
Squaring and summing both sides of Equation (5.17) results in the total sum of 
squares on the left-hand side expressed as a sum of the sums of squares for treat- 


ments, experimental error, and sampling, respectively. The fundamental partition 
of the total sum of squares is 


SS Total = S'S Treatment + SS Error + SS Sampling (5.18) 


The sums of squares partitions are summarized in the analysis of variance shown in 
Table 5.5 with expected mean squares for the model with fixed treatment effects. 
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Table 5.5 Analysis of variance for the completely randomized design with sub- 


samples? 

“Sourceof -—s»dDegreesof § Sumof | Mean ~*Expected—S 
Variation Freedom Squares Square Mean Square 
Total trn — 1 SS Total 
Treatments t—1 AY 2 MST o4 +no? + rné? 
Error t(r — 1) SSE MSE o4+no? 
Sampling tr(n — 1) SSS MSS a 


t 
SST = SS Treatment = rn 2 YY, —¥ 


t r 
SSE = SS Error = ee 2, Big > 


ct el 


ee 
SSS = SS Sampling = 2X a 


Two sources contribute to variation in the observations that make up the esti- 
mate of a treatment mean: the variation among the replicate experimental units 
treated alike (c”) and the variation among the sampling units within the same ex- 


perimental units (03). items the variance of a treatment mean is 


(5.19) 


when there are n subsamples from each of the r replicate experimental units. The 
standard error of any treatment mean is estimated by 


sy, = soa (5.20) 


TT 


_ The eee of variance for a data in Table 5.4 is shown in Table 5.6. The 
1] o differences among treatment effects, Ho: 7 = 0, is rejected if 
T r | M SE D excesds Fe (t- a eli. The Fp statistic? in Table 5.6 to test the 
null hypothesis of no Teer between the means of methods A and B, Fo 

= 7550.08/190.08 = 39.72, is exceeded with probability .003. Since the mean of 
method A, ¥, = 120, exceeds that of method B, y. = 69.8, it may be concluded 


that method A recovers more of the pesticide residue than method B. 


2 The sum of squares for “Error” represents the sum of squares for experimental units nested within the 
treatment groups. A computer program will require a term in its syntax that designates the experimental 
units within treatments. 

3 Frequently, it is necessary to specify the correct denominator for the Fo statistic in the instructions to 
a computing program. By default, many programs utilize the last partition sum of squares for the 
denominator of the Fo statistic for all lines of the analysis of variance table. 
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Table 5.6 Analysis of variance for pesticide residue from subsamples of cotton 


plants 
Source of | Degreesof Sumof Mean 
Variation Freedom Squares Square a Prof 
Total 1] 8640.91 
Methods l 7550.08 7550.08 39.72 .003 
Error 4 760.33 190.08 
Sampling 6 330.50 55.08 


The standard error estimate for a method mean, Equation (5.20) is 


190.08 
Sy, = 6 = 5.63 


and the standard error estimate of the difference between the two method means is 


2(190.08) 
59, — 4.) ae 6 = 7.96 


5.10 Using Variance Estimates to Allocate Sampling Efforts 


The distribution of resources at the planning stage of an experiment involving sub- 
samples requires decisions regarding the number of experimental units to use and 
the number of subsamples to take from each experimental unit. The objective is to 
have a design that results in greater precision—a smaller variance for the estimate 
of a treatment mean (oz ) for a fixed cost. When estimates of the variance compo- 


nents and relative costs of the experimental and sampling units are available it is 
possible to provide an optimum allocation of effort between experimental units and 
sampling units in the experiment. 

Cochran (1965) provided an optimum allocation solution based on the cost 
function C = cir + corn. The value of C is the cost for a single treatment in the 
experiment composed of r experimental units each at a cost of c; and rn sampling 
units each at a cost of cp. The objective may be posed as an attainment of minimum 
cost (C) for a fixed variance in Equation (5.19), or the attainment of a minimum 
variance for a fixed cost. Either way the solution for the number of sampling units 


(n) is 


B 
n= 4/74 (5.21) 
C206 


The value of r is found by solving the cost equation for r if the cost is fixed or by 
solving the variance Equation (5.19) if the variance is fixed. 
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Example 5.4 Pesticide Residue Revisited 


An optimum allocation for plots of cotton plants and subsamples per plot is 
required for pesticide residue studies. The estimates of the variance com- 
ponents obtained from Table 5.6 are 


Gi = MSS = 55.08 
and 


~g (MSE-—MSS) — (190.08 — 55.08) 
C ee nein, ey OU 

n es 
Suppose the cost of one plot is c; = 1.0 relative to the cost of preparing and 
analyzing one subsample, cy = 0.1. The estimated number of subsamples per 
plot is 


1(55.08) 


a = RG 
0.1(67.50) 


Three subsamples would be required from each plot. If the investigator de- 
sired a standard error for the treatment mean of oz, = 3 or a variance of 9, 


the number of required plots r can be found from the substitution of the re- 
quired quantities in Equation (5.19). The substitutions are 


G? =9, 6 = 67.5, 7; = 55.08, and n = 3 


Yi.. 
so that 


55.08 67.5 2. 
+ —-; 9r= 85.86; r= 9.54 
r-3 at ae 


O= 


The investigator would have to use ten plots with three subsamples per plot to 
have a standard error of treatment means equal to 3 with relative costs of 1.0 
and 0.1 for plots and subsamples, respectively. 


5.11 Unequal Numbers of Replications and Subsamples 


Unequal subsample and replication numbers can occur in a study. The three differ- 
ent possibilities are (1) unequal numbers of experimental units per treatment with 
unequal numbers of subsamples per experimental unit, (2) equal numbers of ex- 
perimental units per treatment with unequal numbers of subsamples per experi- 
mental unit, and (3) unequal numbers of experimental units per treatment with 
equal numbers of subsamples per experimental unit. Any imbalance in the number 
of observations at the experimental unit or subsample stage affects the computa- 
tions for the analysis of variance and the expected mean squares. 
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Example 5.5 _ Biology of the Tobacco Budworm 


Populations of insects often develop resistance to the toxic effects of an 
insecticide after long-term exposure to the insecticide. When this resistance 
develops the insecticide is no longer effective to control the population below 
levels harmful to the crop. | | 

Populations of the tobacco budworm, an insect pest harmful to the cotton 
plant, have developed resistance to a number of common insecticides. The in- 
secticides are one component of the overall program of insect control in 
crops. Other components of the control program are also dependent on the 
biology of the insects in terms of their reproductive life cycle and develop- 
mental patterns. 


Research Hypothesis: Entomologists hypothesized that the development of 
insecticide resistance could also affect other aspects of the tobacco bud- 
worm’s biology. If this were true, then the changes in the insect’s biology 
would have an effect on tobacco budworm control programs. 


Treatment Design: The treatments used to address the research hypothesis in- 
cluded three strains of the tobacco budworm: (1) USDA, a strain very suscep- 
tible to a pyrethroid insecticide; (2) Resistant, a strain quite resistant to the 
insecticide; and (3) Field, a naturally occurring strain collected in a local cot- 
ton field. Both the Resistant and USDA strains were populations maintained 
in artificial environments to sustain their resistance characteristics. Any dif- 
ferences in the biology of the two strains were considered reflective of 
changes associated with developed insecticide resistance. The biology of the 
naturally occurring Field strain served as a control treatment for this supposi- 
tion. One of the characteristics measured to evaluate the biology was the 
weight of male larvae. 


Experiment Design: Six random matings between female and male moths 
were made from each of the strains, and the offspring from each mating were 
reared in separate enclosures in the laboratory. The 18 enclosures were placed 
randomly within the rearing facility. Unequal numbers of offspring resulted 
among the 18 matings as shown with the data listed in Table 5.7. 


The Statistical Model and Analysis 


Suppose the experiment has r; experimental units for the ith treatment group and 
ni; subsamples for the 7th experimental unit of the zth treatment. The statistical 
model for unequal subsamples is 


Yijk = MT + Cig + dizer (5.22) 


$=1,2,...,0 fH1j20.jn  BH1,2,0..,my 
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Table 5.7 Weight of male larvae from six matings in each of three tobacco 
budworm strains 


Strain Mating Weight Neg Ni. Vig. Yi; 

USDA | 305, 300 2 302.5 
376, 363, 389 3 376.0 
3 282 l 282.0 
4 309, 321 2g 315.0 
5 354, 308, 327 3 329.7 

6 330 | 2 330.0 330.3 
Field 7 280 | 280.0 
8 311, 349, 291, 286 4 309.3 
9 377, 342 2 359.5 
10 346, 340, 347 3 344.3 
11 360 l 360.0 

2 359, 299 2 13 329.0 329.8 
Resistant 13 27935; 216 Z Pg. ee 
14 212,200 2 2625 
15 3135; 2625297 5 291.3 
16 323 l 323.0 
17 Zoe 1 2520 

18 319, 298 Z 11 308.5 285.5 

y =3164 


Source: Dr. T. Watson and S. Kelly, Department of Entomology, University of Arizona. 


where pu is the general mean, 7; is the fixed effect of the 2th treatment, e;; is the 
random experimental error effect for the jth experimental unit of the ith treatment, 
and d;;;, is the random effect for the kth subsample of the jth experimental unit of 
the ith treatment. We assume the e;,; and d;,;, are normally distributed independent 
random effects with means 0 and variances o? and 03, respectively. If treatments 
are random, then the fixed treatment effects (7;) in Equation (5.22) are replaced by 
the random group effects (a;), which have a normal distribution with mean 0 and 
variance 0°. 

The sums of squares partitions for the analysis of variance and expected mean 
squares for random treatment effects are shown in Table 5.8. If treatment effects 
are fixed, replace c.a2 with 5* n;; (4; — @)?/(t — 1). 

Notice that the squares of the treatment mean deviations in SST are weighted 
by the number of observations on the treatments, n;,, and the squares of the experi- 
mental error deviations in SSE are weighted by the number of subsamples for the 
experimental units, n;;. The coefficients for the variance components in the ex- 
pected mean squares with random treatment effects are 
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Table 5.8 Analysis of variance for the completely randomized design with 
unequal numbers of replications and subsamples 


Source of Degree of Sum of Mean Expected 

Variation Freedom Squares Square Mean Square 
Total N-1 SS Total 
Treatments t-—1 SST MST 04 +c\0? + e920? 

t 
Error Sor, —t SSE MSE o4+c307 
i=1 
t 

Sampling N — Yor; S55 MSS oF 


r% 
mu. =>) Nig 
j=1 


SS Total = 3 st 3 (yisk — 9.) 


=e 


S&T = 3 ng. - oY 


er 


SoE = BO — ¥i,.)° 
reas ie | 
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(5.23) 
where 


Sa 


t V7 n?. t T; E 
A =) ee ad De on? 
1 jai \ i=1 i=l 
When the number of subsamples are equal for each of the experimental units, 


then n;; = n for all 2 and 4; the coefficients are 


l 2 
C126 = w and fos vy iy _ x (5.24) 
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When c; = c3 = n the expected mean squares for Treatments and Error will be 
identical under the null hypothesis Ho: 7; = 0, and the statistic Fy = MST/MSE 
is used to test the hypothesis. 

The analysis of variance for the data is shown in Table 5.9. The expected mean 
squares for analysis of variance are also shown in Table 5.9 for random treatment 
effects. The calculations of the coefficients for the variance components are shown 
in Appendix 5A. 


Table 5.9 Analysis of variance for male larval weights from six matings in each 
of three tobacco budworm strains 


Source of | Degrees of Sum of Mean Expected 
Variation Freedom Squares Square Mean Square 
Total 23: 46516.75 

Strain Z 15187.05 7593.52 03 +2,3602 + 11.970? 
Error 15 23082.45 1538.83 03+ 1.930? 

Sampling 18 8247.25 A58.18 oF 


Tests of Hypotheses Require Approximate F Tests 


When the subsample numbers are not equal, c,; and cz can have different values. 
There is no exact test of the null hypothesis for treatment effects because no two 
mean squares have the same expected mean squares under the null hypothesis if c; 
and c3 have different values. An approximate Fo statistic can be calculated to test 
the null hypothesis of no treatment effects when c; 4 c3. An approximate test is 
necessary for the tobacco budworm experiment since c; = 2.36 and c3 = 1.93 in 
Table 5.9. 

A Mean Square for Error is devised with an expectation equal to that of the 
Mean Square for Treatments, given a true null hypothesis with E(MST') = 
03+ c\0?. 

The required mean square is constructed with a linear function of MSS and 
MSE as 


M=a,\MSE+a.MSS (5.25) 


If ay = c,/cz and ay = 1 —c;/c3, the expected value of M will be o4 +c,02 as 
required for the approximate Fo test. 


The Satterthwaite Approximation for Degrees of Freedom 


Satterthwaite (1946) derived the following result for a linear function of mean 
squares. Given a linear function //, where 


M =a,MS, + a.MS,+---+a,MS; (5.26) 
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and MS, MSo,... , MS; are mean squares with degrees of freedom 14, v2, ... Vp, 
respectively, the degrees of freedom for M are approximated by 
M2 
a 


i=l VY; 


The linear function of mean squares necessary to test the hypothesis of no 
difference in mean larval weights among the three tobacco budworm strains 
requires 


2.36 
C3 1.93 C3 


MSE = 1538.83 with 15d.f. and MSS = 458.18 with 18 df. 
From Equation (5.25) 
M =a,MSE+a2MSS = 1.22(1538.83) — 0.22(458.18) = 1776.57 
The degrees of freedom for M from Equation (5.27) are 


é 1776.57 
’ = 1122(1538.83))2 | — 0.22(458.18) 
15 - 18 


= 13.4 


The truncated value of v = 13 is used as the degrees of freedom for M. The value 
of the test statistic is Fy = MST/M = 7593.53/1776.57 = 4.27. The critical 
value at the a = .05 level of significance is F'o5,.2.13 = 3.81, and the null hypothesis 
is rejected. There are some differences among the mean larval weights of the three 
strains. The test is only approximate, and it should be noted that the approximation 
is degraded somewhat if some of the coefficients (a;) in Equation (5.26) are 
negative. 


EXERCISES FOR CHAPTER 5 


1. A genetics study with beef animals consisted of several sires each mated to a separate group of 
dams. The matings that resulted in male progeny calves were used for an inheritance study of birth 
weights. The birth weights of eight male calves in each of five sire groups follow. 


170 CHAPTERS EXPERIMENTS TO STUDY VARIANCES 


C. 
d. 


Sire Birthweights 

ag GT, 100, 56, 113; 99; 103, 73562 
200 73, 102, 95, 103; 98, 115, 98, 94 
201 58, 60, 60, 57, 57, 59, 54, 100 
202 I 36, 67, 99, 98, 121, 101, 101 


203 39, 46, 120, 115, 115, 93,105, 75 


Source: Dr. S. DeNise, Department of Animal Sciences, 
University of Arizona. 


Assume a random model for this study. Write the linear model, explain each of the terms, 
compute the complete analysis of variance, and show the expected mean squares. 

Estimate the components of variance for sires and progeny within sires, and determine the 
90% confidence interval estimates. 

Test the null hypothesis Ho: 0? = 0 for the sires. 

Estimate the intraclass correlation coefficient, and give the 90% confidence interval estimate. 


The data from Exercise 3.5 are cholesterol concentrations from laboratory analyses of 2 samples 


from each of 8 patients. 


a. 


b. 


C. 
d. 


Assume a random model for the study. Write a linear model, explain each of the terms, 
compute the analysis of variance, and show the expected mean squares. 

Estimate the components of variance for patients and samples and determine the 90% confi- 
dence interval estimates. 

Estimate the intraclass correlation coefficient and give the 90% confidence interval estimate. 
What is the interpretation of the intraclass correlation coefficient in this study? 


Think of research problems in your field of interest for which the treatments in the study could be a 


random sample from a large population of treatments. 


a. 
> 
c. 


- 


Describe a particular study you could conduct. 

Describe how you would conduct the study. 

Write the linear model for your study; identify the terms; and write out the analysis of variance 
table showing sources of variation, degrees of freedom, and expected mean squares. 

Explain why it would be important to know the magnitude of the Among Group and Within 
Group components of variance. 

Describe how you would use estimates of the components of variance. 

What assumptions do you have to make about your study to have valid inferences from your 
variance component estimates? 


A plant pathologist took four 3-pound samples from 50-ton lots of cottonseed accumulated at 


various cotton gins during the ginning season. The samples of seed were analyzed in the laboratory 
for Aflatoxin, which is a toxin produced by organisms associated with the seeds. The Aflatoxin 
concentrations in parts per billion for samples from eight lots of cottonseed follow. 
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Lot Number Aflatoxin (ppb) 
3469 — 72 39, 57, 63, 66 
3849 — 52 $6)513%25,°31 
3721 — 24 64, 83, 88, 71 
3477 — 80 29 S59 2TH S51 
3669 — 72 38, 66, 53, 81 
3873 — 76 11, 49, 34, 10 
3777 — 80 Peas ie ae = 5. 

3461 — 64 10. $1, 25, ol 


Source: Dr. T. Russell, Department of Plant Pathology, Univer- 
sity of Arizona. 


Assume lots and samples within lots are random effects. Write the linear model for the study, 
explain the terms, compute the complete analysis of variance, and show the expected mean 
squares. 

Estimate the components of variance for lots and samples within lots. 

What is the total variance estimate (G,) for an individual observation? 

What proportion of the total variation Cr) in Aflatoxin can be attributed to variation among 
lots and samples within lots, respectively? 

What is the standard deviation estimate (¢, ) for an individual observation? 

Explain how the variance component estimates might be used to plan future sampling for Afla- 
toxin contamination. 


Think of research problems in your field of interest that require you to take samples of the experi- 
mental (or observational) unit because the unit cannot be measured in its entirety. 


a. 
b. 
C. 


d. 


Describe a specific study you could conduct. 

Describe how you would conduct the study. 

Write a linear model for your study; identify the terms; and sketch the analysis of variance 
showing sources of variation, degrees of freedom, and expected mean squares. 

What would be the relative costs for experimental units (c,;) and sampling units (cz)? 


A study was conducted on high-energy particulate cartridge filters used with commercial respira- 
tors for protection against particulate matter. One particular test included three filters randomly se- 
lected from each of two manufacturers. Three independent replicate tests were made on each of the 
filters. The measurements were the percent penetration by a standard type of test aerosol. 


Manufacturer | Manufacturer 2 
Filter ] a 3 4 5 6 
P12 0.16 UE 0.91 Les See ee we 
Lid. ad 0.12 0.83 0.83 Log 
1 ie 0.26 0.12 0.95 0.61 1.58 


SES eee omen TS ee SAR SM enn 2 PRUE yy et TAO cena aE RASS rN Se ee ee Se eee ene ree ee eee) ESTES 
Source: R. J. Beckman and C. J. Nachtsheim (1987), Diagnostics for mixed-model 
analysis of variance, Technometrics 29, 413-426. 
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a. Write a linear model for this study, explain each of the terms, compute the analysis of vari- 
ance, and show the expected mean squares. 

b. Test the hypothesis that there is no difference between the average percent penetration of the 
filters for the two manufacturers. 

c. Compute the means, their standard errors, and the 95% confidence interval estimates of the 
means for each of the manufacturers. 

d. Suppose the relative costs, c,:co, for the study are 200:1, where c, is the cost of a filter and co 
is the cost of an independent filter test. The engineers wanted to achieve a standard error for a 
mean of 0.20. How many filters and how many tests per filter would be required? 


7. A soil scientist studied the growth of barley plants under three different levels of salinity in a con- 
trolled growth medium. There were two replicate containers for each treatment in a completely ran- 
domized design and three plants were measured in each replication. The data on the dry weight of 
the plants in grams follow. 


Salinity Container Weight(g) 
Control l fied, tue, (1.Fo 
2 tds OI, 8.50 
6 Bars 3 5.64, 5.98, 5.69 
4 4.20, 3.34, 4.21 
12 Bars 5 4.83, 4.77, 5.66 


6 Jie, 2.09 


Source: Dr. T. C. Tucker, Department of Soil and Water Science, 
University of Arizona. : 


a. Write a linear model for an analysis of the data, explain the terms, compute the analysis of 

variance, and show the expected mean squares. 

Test the hypothesis of no difference among the means of the salinity levels. 

Compute the standard error of a salinity level mean. 

d. Partition the sum of squares for salinity into two orthogonal polynomial sums of squares 
(linear and quadratic), each with 1 degree of freedom, and test the null hypotheses of no linear 
or quadratic regression. 

e. Suppose the relative costs, c,:c2 are 10:0.1, where c, is the cost of sates up and maintaining 
another replicate container and c» is the cost of measuring the weights in a container. How 
many replicate containers and plants per container would be necessary to achieve a standard 
error for a treatment mean of 0.75? 


ie 


8. The porosity index is a measure used by soil scientists to assist in the prediction of water move- 
ment, storage, availability, and aeration conditions of soils. A soil scientist utilized a special sam- 
pling design to take soil samples from one of the university experiment farms to measure the 
porosity index of the farm soil. The farm was partitioned into fields of approximately 4 hectares 
each divided into eight sections. The sampling plan included a random selection of fields from 
which sections were randomly selected. Locations for soil subsamples were randomly selected 
within the sections. A special staggered sampling design from Goldsmith and Gaylor (1970) was 
utilized for the study. The porosity index of each soil subsample follows. 
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Field Section Porosity Field Section — Porosity 
l 1 3.846, 3.712 9 17 5.942 
2 5.629, 2.021 18 5.014 
2 3 5.087 10 19 5.143 
4 4.621 20 4.061 
3 2 4.41] 1] 21 3.835, 2.964 
6 Re # ZZ 4.584, 4.398 
4 : 3.991 12 23 4.193 
| 8 5.766 24 4.125 
5 9 5.0/7 13 = 3.074 
10 o.353 26 3.483 
6 he A 355, 6,292 14 27 3.867 
ee 4.940, 4.810 28 4.212 
7 13 2.983 15 29 6.247 
14 4.396 30 4.730 
8 15 5.603 
16 3.683 
Source: Dr. A. Warrick and M. Coelho, Department of Soil and Water Science, University of 
Arizona. 


a. Assume all effects are random. Write a linear model for the study, explain each of the terms, 
compute the analysis of variance for the data, and show the expected mean squares. 
. Estimate the components of variance for fields, sections, and samples. 
c. Test the null hypothesis Ho: c? = 0 for the fields’ component of variance. 
d. Test the null hypothesis Hp: 0? = 0 for the sections’ component of variance. 


Use the data from Exercise 5.3 to determine how many samples the plant pathologist would have to 
take from each of five lots of cottonseed to detect a ratio 0?/o? = 2 at the .01 significance level 
with a power of .90. 3 
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5A Appendix: Coefficient Calculations for Expected 
Mean Squares in Table 5.9 


fe - |e ee oe fe +e te 
12 13 11 


= 7.11655 


BiG tee 
i=1 j= 


t 
D= done = 1274 132411? = 434 


— — pemaraay —— 36 = x 
— ISG — 97 


oe ~~ : (N= Ay= 75 (36 — 7.11655) = 1.93 
> ri-t 
i=1 


C] 


C2 


6 Factorial Treatment Designs 


The factorial treatment design was introduced in Chapter 1 as a way to investigate 
ae ee. | the relationships among several types of treatments. The basic factorial treatment 
¢ design in a completely randomized experiment design and its analysis are intro- 
3 duced in this chapter. Planned contrasts and response curve estimation, discussed 

in Chapter 3, are applied to the factorial treatment design. Methods to determine the 

number of required replications and to analyze the factorial treatment design with 
one replication or unequal treatment replications are discussed as well. 


6.1 Efficient Experiments with Factorial Treatment Designs 


Comparisons among treatments can be affected substantially by the conditions 
under which they occur. Frequently, clear interpretations of effects for one treat- 
ment factor must take into account the effects of other treatment factors. A special 
type of treatment design, factorial treatment design, was developed to investigate 
more than one factor at a time. 

Factorial treatment designs produce efficient experiments. Each observation 
supplies information about all of the factors, and we are able to look at responses to 
one factor at different levels of another factor in the same experiment. The response 
to any factor observed under different conditions indicates whether the factors act 
on the experimental units independently of one another. Interaction between factors 
occurs when they do not act independently of one another. 


Asphalt pavements undergo water-associated deteriorations such as cracking, 
potholes, and surface raveling. The weakened pavement occurs when there is 
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a break in the adhesive bond between aggregate and the asphaltic cements 
that make up the pavement. Research is directed to find improved asphalt 
pavements more resistant to deterioration. 

The ability to develop superior asphalt pavement mixes requires a reliable 
method to test the experimental mix for bonding strength. Several methods 
have been developed to compact asphaltic pavement specimens in prepara- 
tion for bonding strength tests. 


If two compaction methods pro- 
duce the same relative results for strength tests with two different aggregate 
types, then either compaction method could be used to evaluate experimental 
___ asphalt mixes for either aggregate type. If the results are dependent on aggre- 
__ gate type, then one or both of the compaction methods may not be adequate 
_ for discriminating between experimental mixes of asphalt. 

The factorial treatment design can be used to evaluate whether the two 
factors act independently on the strength of the test specimens. 


For illustration, 
specimen bonding tensile strength values are shown in Table 6.1 for the four 
treatments as pressure (psi) at test failure. 


mpaction method and aggregate 
type, oe 1 different categories of a fact ctor are lev sels ae the factors. The levels of 
compaction method are Static and Regine Hd the levels of aggregate type are 
Silicious Rock and Basalt. The factors are identified by uppercase letters A and B 
in Table 6.1. The levels of the factors are denoted A,, Ao, ... ; By, Bo, ... ; and 
so forth. The factorial arrangement in Table 6.1 with two factors A and B each with 
two levels, has 2 x 2 = 4 treatment combinations, A; B), A; Bo, Ao B,, and Ap Bo. 


Table 6.1 Tensile strength (psi) of asphalt specimens 


Compaction Method (B) 
Aggregate (B,) (Bo) Aggregate 
Type (A) Static Kneading Means 
Silicious (A)) 64.0 
Basalt (A2) 81.0 
Compaction Means 66.5 78.5 


Source: A. M. Al-Marshed (1981), Compaction effects on asphaltic concrete durability. MS. — 
Civil Engineering, University of Arizona. 
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The factorial treatment design consists of all possible combinations of the lev- 
els of several factors. Experiments with factorial treatment designs often are re- 
ferred to as factorials or factorial experiments. 

The levels of a quantitative factor take metrical values, whereas the levels of a 
qualitative factor are categories of the factor. Both factors in Example 6.1 are quali- 
tative factors; the two levels of each factor are categories. Temperature exemplifies 
a quantitative factor with levels of 10°C, 20°C, and 30°C, for instance. 


6.2 Three Types of Treatment Factor Effects 


pe 
OF Means 
of awe Fachoy 
level bh le 
dhe otter ''s 


held cowstand 


The effect of a factor is a change in the measured response caused by a change in 

the level of that factor. Three effects of interest in a factorial experiment are simple 
effects, main effects, and interaction effects. These effects are illustrated with popu- 
lation means for the factorial treatments in Example 6.1. __ 

The population means for a factorial experiment with two factors, A and B, 
can be represented with cell means j1;;. The term ce// mean is derived from a tabled 
display of means for each of the treatment combinations, illustrated in Table 6.2 for 
a2 x 2 factorial. | 

The means of the treatment combinations, f11, /12, (421, and jZ29, are located in 
the cells of the table—hence, the designation cell means. The means on the mar- 
gins of the table are the averages of the cell means and are referred to as the mar- 
ginal means. The overall or grand mean is the average of the cell means, 


= q(eair + pia + fai + f22). 


Table 6.2 Table of means for a2 x 2 factorial experiment 


B 
A 1 Static 2 Kweced Factor A Means 
1 Silictoug Hi H12 2, = 5 (mi + p12) 
2 Basalt 21 [22 Ply, = 5(a1 + 122) 
Factor B Means is Fh 


= 5(éir + oar) = 5 (u12 + M22) 
Simple Effects Are Contrasts 


The simple effects of a factor are contrasts between levels of one factar at a single 
level of another factor. The simple effect (/,) of aggregate type (A) on tensile 
strength with static compaction (6;) calculated from the cell means in Table 6.1 is 


It measures the difference in tensile strength between basalt and silicious rock 
specimens when the static compaction method is used to form the specimens. The 
average tensile strength of silicious rock specimens was greater than that for basalt 
specimens. Similarly, the simple effect 
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Di; enewee. of 
Owe factors meow 


dwA owother Fachors 
MEOW. 


lo = fog — fiz = 97 — 60 = 37 


measures the difference 1 in tensile strength between basalt ilicious 

mens when the kneading compaction method is used. In this case, ie: average ten- 
sile strength of the basalt specimens was greater than that for the silicious rock 
specimens. 


Main — pre a Effects of a — 


e contrasts be evels_of one factor averaged 
aig The main effect of aggregate type on tensile 
aye is the difference between the marginal means for aggregate type in Table 


Upon close inspection the main effect for aggregate type can be expressed as 
the average of the two simple effects. Thus, from Equation (6.1) 


lz = Plo, — (ua + p22) — (on + }12) 
aes views 
= 5 (M22 — Ha) + 5 (Ha1 — Hat) “oe He a VEN ICS 


l 

= 5 (4 + 5) > averag& of 
gimyle€ 

"5 ae Be Ae af — 3 +37). The main effect contrast, J; = 17, implies the basalt speci- 

mens are stronger than the silicious rock specimens. The simple effect contrast for 


kneading compaction, J, '= 37, supports the same conclusion. However, the simple 
effect contrast for static compaction, |;-=""— 3, suggests the opposite conclusion. 


Interaction Effects Are Differences Between Simple Effects 


hi n et res | he simple effects” of one 
fac actor at different levels of the other factor. Consider the two simple effects of 
aggregate De. on tensile Seabee shown t in Display 6.1. | 


tensile strength c The eee rcuee Geamnen basalt and piicicus wae 
specimens was 40 units greater with kneading compaction than it was with static 
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Display 6.1 Interaction Effect of Compaction Method 
with Aggregate Type 


Compaction Method Simple Effect of Aggregate Type 


Static I, = 65 — 68 =-3 
Kneading lp = 97 — 60 = 37 
Difference l4 = 37 — (-3) = 40 


compaction. T 


The example illustrates the caution that must be exercised in interpreting main 
effects and represents a situation in which interpretations should not be based on 
main effects. The effect of aggregate type on tensile strength of the specimens dif- 
fered considerably between the static and kneading compaction methods. Although 
the main effect measurement suggested the basalt specimens would be stronger, it 
was only true for the kneading compaction method. Se 

The 2 x 2 fa the difference between the 


(6.2) 


The same expression may be derived from the simple effects for B. 

The presence or absence of interaction is illustrated graphically in Figures 6.1 
and 6.2 for a factorial arrangement with two factors, A and B, each at two levels. 
The graphic measure of a simple effect for each factor is illustrated in Figure 6.1. 
The response to A is graphed separately for each level of B. In the absence of inter- 
action the parallel lines show an identical response to A for both levels of B. Under 
these circumstances, the factors act independently, and the main effects can be used 
to interpret the effects of the two factors separately. 

The presence of interaction is illustrated in Figure 6.2. The response lines are 
not parallel when there is an interaction between the two factors. Differences in the 
magnitude of the responses (Figure 6.2a) or in direction of the responses (Figure 
6.2b) represent interaction between the factors. The factors do not act indepen- 
dently, and interpretations should be based on simple effect contrasts. 


Example 6.2 Compaction Effects on Asphaltic Concrete Revisited 


Research Objectives: The variation in tensile strength of asphaltic concrete 
test specimens, as discussed in Example 6.1, was known to be associated with 
the compaction method and aggregate type used to construct the specimens. 
A civil engineer conducted an experiment to evaluate differences among a set 
of compaction methods for their effect on the tensile strength of test speci- 
mens and to determine to what extent the aggregate type affected the com- 
parisons among the compaction methods. 
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Response 


Simple effect 
of Bat A, Simple effect 
of Aat B; 


wes ee ses ces eseeesrscesseascoccnseasece: 


Figure 6.1 [Illustration of no interaction in a factorial arrangement 


Response Response 


Aj A2 
(b) Interaction as a difference 
in direction of response 


(a) Interaction as a difference 
in magnitude of response 


Figure 6.2 Illustration of interaction in a factorial arrangement 


Treatment Design: A factorial arrangement was used with “compaction 


method” and “aggregate type” as factors. There were two levels of aggregate 
type—Ay, (basalt) and Ap» (silicious rock)—and 


Experiment Design: Three replicate specimens of the asphalt concrete were 
constructed and tested for each of the eight treatment combinations. The 24 


specimens were prepared and tested in random order for a completely ran- 
domized design. 


The data for tensile strength measurements (psi) on the 24 specimens are 
shown in Table 6.3. 
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Table 6.3. Tensile strength (psi) of asphaltic concrete specimens for two aggre- 
gate types with each of four compaction methods 


Compaction Method 

Aggregate : Kneading 
Type Static Regular Low Very Low 
Basalt 68 126 93 56 

63 128 101 59 

65 133 98 wei 
Silicious 71 107 63 40 

66 110 60 41 

66 116 59 Ad 


Source: A. M. Al-Marshed (1981), Compaction effects on asphaltic concrete durability. M.S. thesis, 
Civil Engineering, University of Arizona. ‘ 


6.3 The Statistical Model for Two Treatment Factors 


The Cell Means Model 


The observations from a factorial experiment with two factors, A with a Eas cote 
- a b San can be represented with the cell means model. The cell means 


pe 


b factorial with r repl. cations in a completely rand paced Kren 


(6.3) 


wt bone eet, 2,..:,7 


nen’ combination A,p,.and e;4, are random 


Least Squares Estimates of Cell Means 


The cell means can be estimated by the least squares method outlined in Chapter 2. 


* | The sum of squares for experimental error is 
a b r . E, 


ee er ee ae er er el et aac: -- 


The least squares estimators for y;; are the observed cell means of the treatment 
combinations. 
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peftie Mipe 5 


ee ee eee 


argina. ae cul unbiased estimates of the factor marginal 
means, so that 7 Zi, = = = 7. ‘and i sia j= Y,. The overall mean 7 is estimated with the 
observed grand mean, y_. Estimated cell and marginal means for the asphaltic con- 
crete specimens are shown in Table 6.4. 


Table 6.4 Estimated cell and marginal means for tensile strength of asphaltic 
concrete specimens 


Compaction Method 
Aggregate Kneading Aggregate 
Type Static Regular Low  VeryLow Means (Y;, ) 
Basalt 65.3 129.0 973 S23 87.3 
Silicious 67.7 111.0 60.7 41.7 70.3 


Compaction means (7; ) 66.5 120.0 <<< 79.0 49.5 7 = 188 


Additivity and Factor Effects 


The cell means ju;; represent the true response for the treatment combination of lev- 
el a for ee and nat j for B. In the absence of interaction the cell mean ex 
pressed as a sum of a gener an, 1, plus an effect contributed by A, say ai, ., and 
PREECE wae: Bs ay Bi, so that [ij = U+ 04 + Bj. 

The effect for the ith level of factor A can be defined asa; = 7; — fZ.. The 
effect of A is a deviation of the marginal mean from ws tae mean. The Ae for 
the jth level of B can be defined as 6; = 7; — 7. The effects will be fixed effects 
if the levels of the factors are reproducible. In the aGUENCE of interaction, the cell 
mean is the sum of the grand mean and the effects of the factors for that cell: 


fig = B+, — B+ - BD) (6.6) 


The effect of the jth treatment combination, 7;; = (ui; — 7Z_), is the sum of the two 
factor effects: 


(Mig — B=, — B+ 3 — BL) (6.7) 


and the effects of the factors are additive in the absence of interaction. 

In the presence of interaction the treatment effect will not be equal to the sum 
of the individual factor effects as shown in Equation (6.7). An interaction effect, 
denoted as (a/);;, can be defined as the difference between the two sides of 
Equation (6.7), or 
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(6.8) 


The new set of parameters, a;, 3;, and (af);;, can be used to write the linear 
model for the factorial as an effects model 


6.4 The Analysis for Two Factors 


Fundamental Sum of Squares Partition 


cee aad 


The fundamental par 


1 of the total sum of squares can be derived from the 


equation 

(Yizk — 9.) = Giz. — 9. + Yigk — Fig) (6.11) 
The deviation of an observation from the grand mean (Yijk — Y..) is the sum of two 
parts: ” 


For example, using the observations in Table 6.3 and the means in Table 6.4, the 
total deviation for the first observation in Table 6.3 is 


(yi — J...) = 68 — 78.8 = — 10.8. 
The treatment deviation is 
(Wy. — ¥..) = (65.3 — 78.8) = — 13.5 
and the experimental error is 
| (Yi11 — Yi.) = (68 — 65.3) = 2.7 


and 
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— 10.8 = — 13.5 + 2.7 


The latter deviation (y;;, — Y,;, is a measure of experimental error for the ob- 
servation in a properly replicated experiment. If both sides of a (6.11) are 
squared and summed over all observations, the result is 


a b r a b 
ig ee > Wisk -7y= oo > G, -%y (6.12) 
t=1 j=l k=1 i=l j=1 | 


b r 
+) = S_ Wise = Vig)” 


i=1 j=l k=1 


or 


SS Total = S'S Treatment + S'S Error 
Any crossproducts formed by squaring the right-hand side of Equation (6.11) sum 
to zero. P There’ are a total of rab observations, so that S'S Total has (rab — 1) de- 
grees of freedom. With ¢ = ab, SS Treatment has (ab — 1) degrees of freedom and 
the remaining ab(r — 1) degrees of freedom are associated with S'S’ Error. 


Sums of Squares for Factorial Effects | 
The treatment effect (y,;, — y._) in Equation (6.11) can be expressed as the identity 
aU 


Gii.- 0J=G.- 9) + + Gy H.-F + FH) 13) 


where the treatment effect is a sum of three effects: 


e factor A main effect (Gj; — 7) 


| oa 


+ interaction Gj, — H..— Ty + T.) 


For example, the treatment effect for basalt with low kneading in Table 6.4 is 


is — Y ) = 97.3 — 78.8 = 18:5 
The main and interaction effects are 
e Basalt main effect : 
(y, — ¥_ )= 87.3 — 78.8 = 8.5 


e Low kneading main effect: 


(93 — 9 )= 190=— 78:8 = 0:2 
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e Interaction: 
(Yi3.-— U1. — Ys + Y_) = 97.3 — 87.3 — 79.0 + 78.8 = 9.8 


and 


18.5 = 8.5+0.2+4 9.8 


If both sides of Equation (6.13) are squared and summed over all observations 
the left-hand side is the SS Treatment. The treatment sum of squares is partitioned 
into three components represented by the effects on the right-hand side of Equation 
(6. ae ss sl Sool formed by squaring the right-hand side will sum to zero. 

The sum of squares for the first component will be the sum of squares among 
the aiaing means for A 


SSA=rb)-G,,-— 9.) (6.14) 
i=1 
and the second will be the sum of squares among the marginal means for B 
b 
Sthatwa> 0, - 7x (6.15) 
j=l 


The sum of squares for the third component 


is the sum of squares for interaction. That part of the treatment sum of squares is 
not explained by the sum of squares attributed to the two factors as SSA and 
SSB. Consequently, the additive partition of SS Treatment is 


S'S Treatment = SSA + SSB + SS(AB) 


The (ab — 1) degrees of freedom for th the Ne tregenen sum w OE are allocated 


respectively, therefore SS.A and SSB ae (a- ~ =f and oe - iy degrees of free- 
dom. The remaining degrees of freedom allocated to the sum of squares for interac- 
tion are the degrees of freedom for treatments (ab — 1) minus the degrees of 
freedom for the separate factor sums of squares (a—1) and (b— 1), or 
(ab - a — iS — — me — 2 (a — 1)(b — 1). The degrees of freedom for inte 


s of squares in factorials is the product of the degrees of freedom f for ihe 
factors included’ in mine aNMcion 
The complete partition of the total sum of squares for a factorial arrangement 
with two factors is summarized in the analysis of variance shown in Table 6.5. 
The derivation of the sum of squares partitions from solutions for the least 
squares normal equations of the effects model is shown in Appendix 6A. The 
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Table6.5 Analysis of variance for a two-factor treatment design 


i autistic ards sg aa a aa de iirlaiaiatelnapm ane 
Source of Degrees of Sum of Mean Expected 
Variation Freedom Squares Square Mean Square 
Total rab — | SS Total 
Factor A Se SSA MSA o? + rb@2 
Factor B b—1 Sop MS o? + rab? 
AB Interaction (a— 1)(b-— 1) SS(AB) MS(AB) o2 + r62, 
Error ab(r — 1) SSE »~ MSE o? 

a b 
= 2 Fi. hoe) 0; = 2 Fj ee ieee 6, 
gu i 
= =Eb = (Hig — Bi, — By +BY /@-—NYC- ) 
i=1 j= 


Table 6.6 Analysis of variance for tensile strength of asphalt specimens in a 
4 x 2 factorial arrangement 


Source of Degrees of | Sumof Mean 

Variation Freedom Squares Square F Pr 
Total 23 19,274.50 

Compaction (C’) 3 16,243.50 5,414.50 569.95 .000 
Aggregate (A) 1,734.00 1,734.00 182.53 .000 
Interaction (AC) 3 1,145.00 381.67 40.18 .000 
Error 16 152.00 9.50 


a ———— 


analysis of variance for the data from Table 6.3 on the asphalt concrete specimens 
is shown in Table 6.6. 


Tests of iceseeee About Factor Effects 


r _ Significance of interaction is tatennined eyes any deteasions sof 
significance for r main n effects ¢ of the ose 


ence of intera B big — Be - -. +7 = 0 from Equation 
(6.8), a 62, = =e 0 in the eet h mean Peaare TOF AB i interaction The null 
hypothesis for interaction 


versus the alternative 
Ha: (@f)ij = wig — Hi, — Hj +H, FO for some z, 7 


is tested with 
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(6.17) 


versus the alternative 
He: fi; # fl, forsome i,k 


is tested with the ratio 


(6.18) 


BBs -B of eaantity Be the means 
Ho: i, =—. =: =H, 
versus the alternative | 
Hy: ji; # Pim for some j,m 


is tested with the ratio 


(6.19) 


with critical value Fy (»~1),ab(r-1): 


F Tests for Aggregate Type and Compaction Method Effects 


The F test for interaction, 


. The marginal means for aggregate 
type are significantly different since Fy = MSA/MSE = 1734.00/9.50 
= 182.53 exceeds F'o5,1,16 = 4.49. The marginal means for compaction method are 
also different since Fy = MSC /MSE = 5414.50/9.50 = 569.95 exceeds F'o5,3.16 
= 3.24. The significance level for each of the tests is listed as Pr > F = .000 in 
the rightmost column of Table 6.6. 

The significant interaction can modify any inferences based on the significant 
differences among the marginal means of aggregate and compaction. The summary 
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table of cell and marginal means in Table 6.4 will aid in the interpretation of the 
results. 


Standard Errors and Interval Estimates for Means 


ee ee 


d cell means of the factorial ex- 


9.5 
Aggregate: = 0S 
— (3)(4) 
C tio ee (6.20) 
ompac ni. ES ae ee ‘ 
. (3)(2) 
9.5 
Cell means: a = 7S 


t.025,16 = 2.12. The interval estimate oe ac aia fee ie 
Vij. + cass (6.21) 


For example, the 95% confidence interval estimate for the basalt aggregate with 
static compaction mean is 


65.3 + 2.12(1.78) 


Interval estimates are calculated similarly for the other means upon substitution of 
the appropriate mean and standard error estimate from Equation (6.20). 


Multiple Contrasts Assist Interpretations of Significant Interaction Effects 


The significant interaction between aggregate type and compaction method indi- 


" cates the simple effects of ‘one factor differ among levels of another factor. Conse- 


quently, tests of hypotheses about the treatment factors initially should be based on 
simple effect contrasts among the cell means. 

The specific research hypotheses for the study will dictate the contrasts among 
the cell means required to investigate the simple effects. One general research ques- 
tion for this study might ask which of the aggregate types results in the strongest 
specimens for each of the compaction methods. Another hypothesis might address 
the effect of the kneading compaction methods relative to static compaction. 

Contrasts among the cell means in Table 6.4 can be used to test the aggregate 
type simple effects (basalt versus silicious) at each level of compaction to address 
the question of which aggregate type results in the strongest specimens for each 
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compaction method. The four oo hese es SN ou: 


Display 6.2 Estimated Contrasts for Aggregate Simple Effects 


Compaction 95% SCI 
Method Contrast (Basalt—Silicious) 0) 
Static = 9). — Un, = 65.3-67.7 =. -2.4 (—9.5,4.7) 
Regular 9. — Yoo = 129.0—-111.0=18.0. (10.9,25.1) 


Low ros, = 9F.3.-- 60.7. =:3656 (29.5, 43.7) 
Very low = Ya — You = 57.3 — 41.7 = 15.6 (8.5, 22.7) 


pes wit — mini since per een des: 0. The : specimens con- 
aaa from basalt have greater tensile strength than those constructed from the 
silicious rock for the kneading compaction methods since lower limits of the 3 
intervals are greater than 0 and the greatest difference between the aggregate types 
was found with the low kneading compaction. 

The comparisons to consider depend on the nature of the problem and the 
information required from the study. Those comparisons discussed in the pre- 
Rass aaa were made ts to ee the eI of interaction. They ex- 


A comparison among the sareel means for a factor can be iets when 

all of its simple effects are of a similar direction and magnitude. The comparisons 
among marginal means result in a more general inference about the factor, and they 
are more precise with smaller standard errors; see Equation (6.20). In the presence 
of interaction caution must be exercised in making generalizations. The difference 
between the marginal means for basalt and silicious aggregate types, 
c = (87.3 — 70.3) = 17.0, with standard error s, = ./2MSE/rb = ,/2(9.5)/12 
= 1.26, will be significant. The main effect estimate is a difference averaged over 
all compaction methods. The simple effects shown in Display 6.2 for aggregate 
type range from c,; = —2.4 to c3 = 36.6, depending on the compaction method. 
Generalizations about aggregate types based on the main effect estimate would be 
misleading. 
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Residual Analysis to Evaluate Assumptions 


The assumptions for the model regarding homogeneity of variances and nor- 
mal distribution can be evaluated with the residuals as discussed in Chapter 4. The 
residuals for the two-factor factorial are computed as the deviations of the observed 
values from the estimated means for each cell in the arrangement. The residual for 
any cell is €;j. = Yije — Vijk = = Yijk — Yi;, For example, the residual for the first 
observation in Table 6.3 is 


en = Yi — Yu, = 68 — 65.3 = 2.7 


The plots for the square root of absolute residuals versus the estimated values 
and the normal probability plot of the residuals are shown in Figures 6.3a and b. 
The plots show no strong evidence of heterogeneous variance or nonnormality. The 
Levene (Med) test for homogeneity of variance is left as an exercise for the reader 
with reference to Chapter 4 on methods for evaluating assumptions. 


w” 
a) 
a) = 
Gn 
2 
=) 
S N 
2 un 
pe 3 
fe) 
= a 
= ] 
TO 
OF gs , 
bi 
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Fitted values Standard normal quantiles 
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Figure 6.3 Residual plots from the analysis of variance for data on strength in 
Example 6.2: (a) square root of the absolute residuals vs. estimated values and (b) 
normal probability plot of residuals 


6.5 Using Response Curves for Quantitative Treatment 
Factors 


een 5 


JO. tray! respon — 
treatment factor throweWoult _ sitiniverieee Fane" oft the fa 
study. The evaluation of response trend curves is aan to the ee 
experiment in this section. The analysis is discussed initially for experiments with 
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one quantitative factor and one qualitative factor. Subsequently, responses will be 
analyzed for experiments with two quantitative factors. 


One Quantitative Factor and One Qualitative Factor 


Ascertaining whether there is interaction between tbe a and qualitative 
factor is the first objective in the analysis. The response to the quantitatiy : 
will be different across levels of the qualitative factor > whes? the t two eae interact. 
In that case, the response curves for the quantitative factor can be estimated sepa- 
rately for each level of the qualitative factor. In the absence of interaction, the 
response trend to the quantitative factor will be similar at all levels of the quali- 
tative factor, and a single response curve will suffice for description of the process 
with respect to the quantitative factor. 


Example 6.3 Heavy Metals in Sewage Sludge 


Sludge is a dried product remaining from processed sewage; it contains nutri- 
ents beneficial to plant growth. It can be used for fertilizer on agricultural 
crops provided it does not contain toxic levels of certain elements such as 
heavy metals. Typically, the levels of metals in sludge are assayed by grow- 
ing plants in media containing different doses of the sludge. 


Research Hypothesis: A soil scientist hypothesized the concentration of cer- 
tain heavy metals in sludge would differ among the metropolitan areas from 
which the sludge was obtained. The variation could be a result of any number 
of causes, including different industrial bases surrounding the areas. If this 
were true, then recommendations for applications on crops would have to be. 
preceded by knowledge about the source of the material. An assay was 
planned to determine whether there was significant variation in heavy metal 
concentrations among diverse metropolitan areas. 


Treatment Design: The investigator obtained sewage sludge from treatment 
plants located in three different metropolitan areas. Barley plants were grown 
in a sand medium to which the sludge was added as a fertilizer. 


Experiment Design: Each of the nine treatment combinations was assigned 
to four replicate containers in a completely randomized design. The 
containers were arranged completely at random in a growth chamber. At a 
certain stage of growth the zinc content in parts per million was determined 
for the barley plants grown in each of the containers. The data are shown in 
Table 6.7, and the analysis of variance is shown in Table 6.8. The manual 
calculations for linear and quadratic sums of squares partitions are shown in 
Table 6.9. 
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Table 6.7 Zinc content (ppm) of barley plants grown in media containing sludge 
at three rates from three metropolitan areas 


City and Rate(MT/hectare) 


A B = 
0.5 1.0 LD 0.5 1.0 Lk 0.5 1.0 iD 
20.4. 2ac25 20.0 bi ire Gere oe eet ee Pe gi Fee 
23.3.2: 39:2, 2-446 Jd ncdevies 1h. 1 WS. 215-98 
Dot: «25S obo 30.8: 253.3, 68.4 is 6 23.2 Te 
22:9;<) 319<2 33.6 sp ReDOF Tit | Po 19.9 - 259 


Source: J. Budzynski, Department of Soil and Water Science, University of Arizona. 


Table 6.8 Analysis of variance for zinc content in barley plants grown in media 
containing sewage sludge at three different rates from three metropolitan areas 


Source of Degrees of | Sum of Mean 

Variation Freedom Squares Square F ‘ga 

Total 35 9993.38 

Rate (R) 2 1945.45 O72 42 ae .000 
Rate linear | 1944.00 1944.00 101.35 .000 
Rate quadratic l 1.45 1.45 0.08 .786 

City (C) 2 5720.67 2860.34 149.13 .000 

Rate x City (RC) 4 1809.40 452.35 23.58 .000 
Rate linear x City (RC) 2 1760.15 880.07 45.88 .000 
Rate quadratic x City 2 49.25 24.63 1.28 .293 

Error 23 517.86 19.18 


degrees of geedom for Rate ai of 5c squares © partition 2 into v4 = of feed each 


. ht for the linear and quadratic rates. The F’ test indicates significance for the linear 
ma ‘regression — partition (Fo = thes). and nonsignificance (Fo = 0.08) for the qua- 
 dratic partition for Rate. ae: 


The 2 degrees of freedom for bad of the Rate by City interaction sums of 
squares indicate the variability among cities in linear and quadratic regression 
coefficients for Rate. The interaction between Rate linear regression and City is 
significant (Fy = 45.88), but the interaction between Rate quadratic regression and 
City is not significant (fo = 1.28). 


Interpret Factor Effects with Regression Contrasts 


The significant interactions between city and linear regression on rate of sludge 
application suggests that interpretations should be based on separate regression 
lines for each city. The estimated linear regression lines for each city are plotted in 
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Table 6.9 Calculation of linear and quadratic contrast sums of squares partitions 
for rate and rate x city interaction 


Rate (metric tons/hectare) Linear Quadratic 
City 0.5 1.0 1.5 yy Pigdes > PoiVis, 
A 24.55 30.45 36.18 11.63 — 0.17 
B 31.18 48.20 72.60 41.42 7.38 
3 19.10 21:90  <20.05 0.95 — 4.65 
Means (7 ; ) 24.94 33.52 ~42.94 18.00 0.84 
Linear (P,;) —1 0 l 
Quadratic (P»;) | —2 l 
SS[Rlinear] = ralX Pry; ]?/X Pz, = 12[18]?/2 = 1944 
SS|[R quadratic = ald Poy ;]?/% Py, = 12[0.84]°/6 = 1.41 
SS{R linear x-C| = rae Pyj¥ji;,)?/& Pz, — SS[R linear] 
2 2 2 
_— AL11.63? + 41.42? + 0.957] See 
(2) 
= Te 
Seihquadx Cc) = roe Poj¥ji;,)?/U Ps; — SS|R quad] 
2 292 2 
: 4[( — 0.17)° + 7.38" + ( — 4.65)°| = Bae cdl 
(6) 
==. 89 


Figure 6.4 along with the cell means. The plot illustrates the Rate(linear) by City 
interaction. The response to rate is linear for each city. The significance of the 
interaction between city and the linear partition for rate shows up in the plot as a 
different linear response of zinc to rate for each city. 

The linear regression contrasts for rate are simple effects for rate computed for 
each of the cities. The linear regression lines can be computed for each city from 
the estimated effects in Table 6.9 or with a standard regression computer program. 
The regression line can be computed following the procedure in Section 3.3 using 
the cell means in Table 6.9. The linear orthogonal polynomial coefficient estimate 
for city A is 


ay += Yo PyiYyj/% i, 
= 1(24.55) + 0(30.45) + 136.18) 11.63 _ 


i= ye + 2 - 


The mean for city A is 9, = 30.39. With \y = 1, R = 1.0, and d = 0.5 (Display 
3.5), the transformation to an equation in terms of rate (2) is 
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Zinc (ppm) 


06°) “Os 1.0 1.2 1.4 
Rate (mt/ha) ; 


Figure 6.4 Mean zinc content versus rate of sludge application for three cities 


Y=Y,, tayP; 
(R — 1.0) 


= 30.39 + — = 18.79 + 11.6(R8) 


The linear contrast for city C (0.95 in Table 6.9) is considerably smaller than 
that for the other two cities, 11.63 and 41.42 for cities A and B, respectively. The 
standard error for the rate linear contrast on the basis of cell means in Table 6.9 for 
any particular city is 


[((MSE)[(—1)?+02+ 12] —(/(19.18)|2] 
Sc = Do ee ee = erg = 3.1 


The 95% simultaneous confidence intervals for the three linear contrasts 
require the Bonferroni t.o5,3.07 = 2.55. The 95% SCI for cities A, B, and C are, 
respectively, (3.73, 19.54), (33.52, 49.33), and ( — 6.96, 8.86). The linear responses 
for cities A and B are significantly positive, with city B having the largest positive 
linear contrast. The interval for city C includes 0, and we can conclude that zinc 
will accumulate in barley crops fertilized with increasing amounts of the sludge by- 
product to the greatest extent from city B and to a lesser extent from that of city A, 
but that there will be no significant accumulation from that of city C. 


Two Quantitative Factors 


The response to two quantitative factors can be represented by a polynomial equa- 
tion with two independent variables. The degree of the polynomial will depend on 
the number of levels for each of the factors. First-degree equations can represent a 
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factor with two levels, second-degree equations for three levels, and so forth. The 
geometric representation of a polynomial equation with two independent variables 
is a response surface in three dimensions. For example, suppose the levels of factor 
A and factor B are represented by two metrical variables x; and x2 in a quadratic 
polynomial. The second-degree polynomial equation 


y = Bo + Bit, + Box? + 3x9 + Bars + Bsx1 22 (6.22) 


is an empirical function commonly used for the approximation of a response sur- 
face in experimental studies. The quadratic surface can be explored for factor levels 
that result in the optimum response or different combinations of factor levels with 
equivalent responses. 

The analysis of the factorial experiment with two quantitative factors consists 
of orthogonal polynomial partitions for the factor main effect and interaction sums 
of squares. The nature of the polynomial response function can be determined from 
these partitions. A graph of the responses can be used as an aid to interpret the role 
that each of the factors plays in the response. 


Example 6.4 Water Uptake by Barley Plants 


Deposited salts accumulate in soils irrigated for agronomic and horticultural 
crops. The increased soil salinity evomually suppresses plant development 
and crop yields. 


Research Hypothesis: An investigator hypothesized that exposure of plants 
to high levels of salts in their media over time eventually inhibits the plant's 
uptake of water and nutrients from the soil, thus suppressing the growth and 
development of plants. An experiment was conducted with barley plants to 
measure the effect of growth medium salinity on water uptake by the plants. 


Treatment Design: A factorial arrangement was used with “salinity of 
media” and “age of plant” in days as the two factors. The plants were grown 
in nutrient solutions with the salinity level adjusted to three different levels. 
The salinity levels expressed as units of osmotic pressure were 0, 6, and 12 
bars. Plants were harvested at 14, 21, and 28 days. 


Experiment Design: Each of the nine treatment combinations of salinity and 
days was assigned to two replicate containers in a completely randomized de- 
sign. The containers were placed in the growth chamber in a completely ran- 
domized arrangement. 

One of the measurements made at harvest was the amount of water uptake 
by the plants during the experiment. Water uptake is expressed as milliliters 
of water uptake per 100 grams of plant dry weight. The data are shown in 
Table 6.10, and the analysis of variance is shown in Table 6.11. Manual cal- 
culations for the sums of squares partitions are illustrated in Table 6.12. 


Computational Notes: The arrangement in Table 6.12 is convenient for manual 
computation of the sums of squares partitions from the cell means. Main effect and 
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Table 6.10 Water uptake (m1/100 g) by barley plants at 14, 21, and 28 days 
grown in solutions with salinity levels of 0, 6, and 12 bars 


Salinity 0 bars 6 bars 12 bars 

Days 14 21 28 14 21 28 14 21 28 
a2 BO) ae aa OD SA. ge oO aS re 
a3 St 4. AS a A ee SS 

WICMIS TY, 212 oe tee to OD 1026 re 3.20. 7.95 


Source: Dr. T. C. Tucker, Department of Soil and Water Science, University of Arizona. 


Table 6.11. Analysis of variance for water uptake by barley plants 


Source of Degrees of Sum of Mean 
Variation Freedom Squares Square F eee 
Total 17 184.73 
Salinity (S) 2 9.51 4.75 8.52 008 
S linear l 74h 72) 12.92 .006 
S quadratic ] 2.30 2.50 4.12 073 
Days (D) 2 DE i gee 136.24 .000 
D linear l 147.00 147.00 fe .000 
D quadratic l 4.99 4.99 8.94 015 
Salinity x Days(SD) 4 18.21 4.55 8.16 005 
S lin x D lin I 1332 13.52 24.24 001 
S lin x D quad l 2.94 2.94 ae 047 
S quad x D lin l ret 12a 2.18 .174 
S quad x D quad l 0.53 0.53 0.96 os 
Error 9 5.02 0.56 


interaction partitions can be computed in the same table. Main effect partitions are 
normally computed from marginal means. However, cell means are used in Table 
6.12 to compute the main effect partitions: thus, repeated values of polynomial con- 
trast coefficients for main effect partitions are necessary for the cell means that con- 
tribute to each of their respective marginal means. For example, 5S), the linear 
contrast for salinity, requires a — | for each cell at the 0-bar level, a 0 for each cell 
at the 6-bar level, and a+ 1 for each cell at the 12-bar level. 

The coefficients for the interaction partitions are determined as the product of 
the coefficients for the corresponding components of the interaction. For example, 
the coefficients for the interaction between salinity(linear) and day(linear) in Table 
6.12 are formed as the products of the coefficients for the linear contrast of the 
main effects for the two factors. Each coefficient for S;D) is a product of the 
corresponding coefficients for S; and D;. The computation is exhibited in Display 
6.3. 
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Table 6.12 Calculation of linear and quadratic sums of squares partitions for 
salinity and day main effect and interaction sums of squares 


Bars Days Means (¥;;.) S| Bo D, Dd, SD; is § Pee Se 2 = 

oi ple —] 1 —1 I l ~1 =| 1 
21 5.35 —1 l 0 —2 0 ie: 0 —2 
28 12.80 aA 1 I = 1 1 1 
6. 4s 4.10 0 —2 —1 0 0 2 —2 
21 6:55 0 —2 0 —2 0 0 0 4 
28 10.20 0 2 1 I 0 0 —2 —2 
i 4 3.10 I 1 —| I —| l — | l 
21 5.20 1 l 0 = 0 <3 0 —2 
28 7.95 1 1 1 I 1 1 1 

Uw Peiitiz  —4.7 —-46 21.0 6.7 — 5.2 —42 27 3.1 

y P2, 6.0 18.0 60 - 13.0 4.0 i 10. Aee 

Ss* 72 a 5.0 Be: 2.9 1.2 0.5 

Effect’ —0.78 — 0.25 $0 a7 — 1.30 — 0.35 0.23 0.09 


*SS = ro: Pay, °/D Ps, ; Bpect = S Pay; [ P? 


ci? aj 


Display 6.3 Computation of Coefficients for Orthogonal 


Polynomial Interaction Contrasts 


—-i -1!1 


Interpretations for the Regression Contrasts 


The Fo statistics in Table 6.11 indicate significant interaction of the salinity(linear) 
partition with the day(linear) and day(quadratic) partitions. Main effect partitions 
for salinity(linear), day(linear), and day(quadratic) were also significant. None of 
the salinity quadratic effects were significant at the .05 level of significance. The 
response of water uptake to salinity level was the primary focus of the investiga- 
tion. A profile plot facilitates the interpretation of the results with significant inter- 
action. A plot of the cell means and the linear regression of water uptake on salinity 
for each day is shown in Figure 6.5. 

The linear regression lines for water uptake on salinity computed separately for 
each day are shown in Figure 6.5 along with the estimated treatment means. The 
salinity linear contrasts for each day computed from the S; column in Table 6.12 
are shown in Display 6.4 along with their 95% simultaneous confidence intervals 
using the Bonferroni t 95,39 = 2.93. The 95% SCI indicate salinity had no effect on 
the water uptake by the plants for the first three weeks up to day 21 since the inter- 
vals for 14 and 21 days include 0. However, by the end of the fourth week, day 28, 
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Water uptake (ml/100 g) 


0 6 42 
Salinity (bars) 


Figure 6.5 Water uptake of barley plants at three salinity levels for 14, 21, and 28 
days 


Display 6.4 Linear Contrasts for Salinity for Each Day 


95% SCI 
Day Linear Contrast (L,U) 


14 Si = — 1(2.75) + 0(4.10) + 103.10) = 0.35 (=155,255) 
i Se —wli(S. 98) 5 ee A Sa 5S (= 83505) 
28 Ss — 112-80) + 000.20).4 117.95). = 485 6 705,- 20) 


Standard error MSE|(— 1)? + 0? + 12]/r = ./0.56 = 0.75 


water uptake by the plants decreased with an increase in the salinity of the medium 
since the upper limit of the confidence interval for the linear contrast was — 2.65. 

The differences among the linear responses to salinity resulted in significant 
interactions between the salinity(linear) partition and the day partitions. The signi- 
ficant interaction between salinity(linear) and day(quadratic) effects indicates that 
the linear response to salinity changes in a quadratic manner over days. 

From day 14 to day 28 the S; contrast values decrease in a quadratic trend. The 
decrease in the contrast value from day 14 to day 21 is (0.15) — 0.35 = -0.50, 
whereas the decrease in the contrast from day 21 to day 28 is (-4.85) — (0.15) 
= — 4,70. 
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Computing a Response Surface Equation 


The polynomial equation relating water uptake to salinity and days will include the 
terms judged significant in the analysis of variance tests. The significant terms were 
Si, Di, Dg, SiDi, and S;D,. The equation in terms of S' and D will be 


Y = Bot PS + BoD + 3D° + B4SD + Bs D° (6.23) 


The coefficients for Equation (6.23) may be estimated directly from the data with a 
multiple regression program, or they may be evaluated from the orthogonal poly- 
nomial equation as shown in Section 3.3. The polynomial coefficient contrasts in 
Display 3.5 are used for the transformations. Let P.,; and P.q represent the poly- 
nomial contrasts for salinity and day, respectively. The orthogonal polynomial equa- 
tion can be written as 


y=pet+ayPys +P + y2Poq + (aY)uPisPid + (@Y)12PisPea (6.24) 


where a is the linear polynomial coefficient for salinity, +, is the linear coefficient 
for day, and (a7); is the interaction coefficient for salinity linear by day quadratic. 
The estimates of the coefficients in Equation (6.24) are calculated in Table 6.12 as 
Effect = & Pois¥j; [3 Pe. For example, the estimate for (ay); from the S;D, line 
in Table 6.12 is —5.2/4 =—1.3. The term (ay): P:;Pia in Equation (6.24) with 
A, = 1 becomes 


ree (5=*) (PR?) =e S1OLS 6) 1) — 21) 


The remaining estimates are computed in the same manner with w= yy = 6.44, 
A, = 1, and Ay = 3, and the resulting equation is 


yj = 5.70 — 0.0133( — 6) + 0.50(D — 21) +.0.0227(D — 21) 
—0.0310(S — 6)(D — 21) — 0.0036(S — 6)(D — 21) 


These equations can be used to explore response surfaces for maxima or mini- 
ma, or they may be used to determine values of the factors that result in equivalent 
responses. Specialized tools for these methods are discussed later in Chapter 13, 
Response Surface Designs. 


6.6 Three Treatment Factors 


The inclusion of additional factors in the treatment design increases the complexity 
of interaction patterns among the treatment factors. The number of treatment com- 
binations increases rapidly as factors are added to the design. The three-factor de- 
sign with a levels of A, b levels of B, and c levels of C’ has abc treatment 
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combinations. A fourth factor, D, with d levels further increases the number of 
treatments by a multiple of d. 

The two-factor design enables the investigation of the first-order, or two- 
factor, interaction AB. The two additional first-order interactions, AC and BC, in 
the three-factor design broaden the inferences from the study. In addition, there is a 
second-order, or three-factor, interaction, ABC’, to consider. Third-order, or four- 
factor, interactions such as ABCD enter the increasingly complex interaction infer- 
ence structure as factors are added to the design. 


Example 6.5 Shrimp Culture in Aquaria 


The California brown shrimp spawn at sea and the hatched eggs undergo lar- 
val transformations while being transported toward the shore. By the time 
they transform to post-larval stage they enter estuaries, where they grow 
rapidly into subadults and migrate back offshore as they approach sexual 
maturity. 

The shrimp encounter wide temperature and salinity variation in their life 
cycle as a result of their migrations during the cycle. Thus, a knowledge of 
how temperature and salinity affect their growth and survival is of great 
importance to understanding their life history and ecology. 

There was at the time of this experiment great interest in commercial cul- 
ture of the shrimp. From the standpoint of mariculture another important fac- 
tor was stocking density in the culture tanks that affects intraspecific 
competition. 


Research Objective: The investigators wanted to know how water tempera- 
ture, water salinity, and density of shrimp populations influenced the growth 
rate of shrimp raised in aquaria and whether the factors acted independently 
on the shrimp populations. 


Treatment Design: A factorial arrangement was used with three factors: 
“temperature” (25°C, 35°C); “salinity” of the water (10%, 25%, 40%); and 
“density” of shrimp in the aquarium (80 shrimp/40 liters, 160 shrimp/40 
liters). The levels chosen were those considered most likely to exhibit an ef- 
fect if the factor was influential on shrimp growth. 


Experiment Design: The experiment design consisted of three replicate 
aquaria for each of the 12 treatment combinations of the 2 x 2 x 3 factorial. 
Each of the 12 treatment combinations was randomly assigned to three aquar- 
ia for a completely randomized design. The 36 aquaria were stocked with 
post-larval shrimp at the beginning of the test. The weight gain of the shrimp 
in four weeks for each of the 36 aquaria is shown in Table 6.13 on a per- 
shrimp basis. 
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Table 6.13. Four-week weight gain of shrimp cultured in aquaria at different lev- 
els of temperature (7), density of shrimp populations (D), and water salinity (S’) 


ye D S Weight Gain (mg) 
25°C 80 10% SG, de, {3 
25% 544, 371, 482 
40% 390,290, 397 
160 10% 53, 73, 86 
25% 393, 398, 208 
40% 249, 265, 243 
a5 80 10% 439, 436, 349 
25% 249, 245, 330 
40% 247, 277, 205 
160 10% 324, 305, 364 
25% 352, 267, 316 
40% 188, 223, 281 


scsi cc ceil bc tc iinreee'ecdesidlensich bigest teacup 
Source: Dr. J. Hendrickson and K. Dorsey, Department of Ecology and Evolutionary Biology, 
University of Arizona. 


The Statistical Model for Three Factors 


The cell means model for a three-factor experiment with r replications of each of 
the abc treatment combinations in a completely randomized design is 


Yijkl = Higk 1 Cijkl (6.25) 


Be ge 2 ers Bee i a ey 
The cell mean ju; expressed as a function of the factorial main effects and interac- 
tions is 
Migk = + a4 + By + Ye + (aB)ij + CoVix + CVn + (@PV)izn (0.26) 
where pp = 7i__ is the general mean and a, (;, and +, are the main effects of A, B, 
and C’. The respective two-factor interaction effects are (a);;, (ay)ix, and (GY)jK; 
and the three-factor interaction effect is (a/);;,. The definitions of main effects 
and two-factor interactions follow from the derivations given in Equations (6.6) to 
(6.8) for the two-factor experiment. The main effects are 
a; = (0; — H.), By = (hy. — HD, Ve = Gk — B..) 
and a typical two-factor interaction is 
(BY) 5k = Bix — 7...) — Bi — Ye (6.27) 
She, Pa 
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The three-factor interaction occurs when the main effects and two-factor inter- 
actions do not satisfactorily explain the variation in the cell mean deviations 
(His: — H.). The three-factor interaction is the difference between the cell mean 
deviation and the sum of the main effects and two-factor interaction effects: 


(ab )ijk = (ize — B.,.) — [oa + Bj + e+ (aB)ij + (aN) ik + (BN 4] (6.28) 
= igh ~ Big =~ Mea a TB ep + 8 


The Analysis for Three Factors 


The sum of squares for treatments is partitioned into main effect and interaction 
sums of squares as 


S'S Treatment = SSA+SSB+SSC + SS(AB) 
+ SS(AC)+ SS(BC)+ SS(ABC) (6.29) 


Keep in mind that the degrees of freedom for main effect sums of squares are 
(a — 1), (6 — 1), and (c — 1), respectively, for factors A, B, and C. The degrees of 
freedom for two-factor interactions are the product of the main effect degrees of 
freedom for the included factors. Likewise, the degrees of freedom for a three- 
factor or higher interaction are the product of the main effect degrees of freedom 
for the included factors, so that S:S( ABC) has (a — 1)(b— 1)(c — 1) degrees of 
freedom. 

The sums of squares partitions and analysis of variance table for the three- 
factor shrimp growth experiment are shown in Table 6.14. The Mean Square for 
Error in Table 6.14 is the denominator of the Fo statistic to test the null hypothesis 
for any set of factorial effects with the fixed effects model. The Fp statistics in 
Table 6.14 lead to the rejection of the null hypothesis for the T'S two-factor interac- 
tion, the 7DS three-factor interaction, and all main effects. The cell and marginal 
means for all factors are shown in Table 6.15. 


Table 6.14 Analysis of variance for weight gain of shrimp cultured in aquaria 


Source of Degreesof  Sumof Mean 

Variation Freedom Squares Square F Pr 
Total 35 537,32701 : 
Temp (7) l 15537600215 ,376:00 5.30 030 
Salinity (S) 2 96,762.50 48,381.25 16.66 .000 
Density (D) ] 2118.78: .. 21,218.78 tak 012 
TS 2 300,855.17 150,427.58 51.80 .000 
TD l 8,711.11 8,711.11 3.00 .096 
SD 2 674.39 337.19 Giz 891 
TDS = 24,038.39 12,019.19 4.14 029 


Error 24 69,690.67 2,903.78 
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Table 6.15 Cell and marginal means of four-week weight gain of shrimp cultured 
in aquaria at different levels of temperature (7’), density of shrimp populations (D), 
and water salinity (S) 


Cell Means (Y;;x,.) 
Density 
80 160 

Temperature : Temperature 
Salinity ~“25°= ~35° i 2 UV ¢ 
10% 70 408 fs oe 220 
25% 466 275 ao he 346 
40% 359 = 243 gee aga 271 

Ix Diteorns “298 309 219i eal 

3.2 
T x S Means (Y; ;, ) D x S Means Fi) 


s 
10% 25% 40% J, 
239 370 301 303 
my 99 55 


10% 25% 40% J, es. 
80 
160 


Interpretations must be conditioned on some measure of statistical significance 
in conjunction with the biological significance of the responses. Standard errors of 
cell and marginal means are required for any subsequent statistical tests of specific 
comparisons. The standard error for any mean is sy = \/ MSE//n, where n is the 
number of observations in the mean. The standard error of the difference between 
any pair of means is s(y; — 9;) = /2MSE /n. The estimated standard errors for 
the shrimp culture experiment are shown in Table 6.16. 


Some Preliminary Interpretations About the Factor Effects 


The significance of the three-factor interaction indicates that temperature, salinity, 
and density are interrelated in their effect on the shrimp growth. The significant 
three-factor interaction implies that the interaction between two factors is not con- 
stant over levels of the third factor. Consider the interaction between density and 
salinity separately at temperatures of 25°C and 35°C, as shown in the graphs of cell 
means in Figure 6.6. 

A comparison of the simple effects of salinity at each level of density and 
temperature can be used to interpret the results. The simple effects of salinity are 
best estimated as orthogonal polynomial linear and quadratic contrasts at each com- 
bination of density and temperature. Sums of squares partitions computed for the 
three factor interaction 7JDS are SS(T x Dx Slinear) = 11,051 and 
SS(T x D x S quadratic) = 12,987 with P-values .063 and .045, respectively, 
indicating the salinity quadratic coefficient is dependent on the levels of tempera- 
- ture and density. 
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Table 6.16 Standard errors for cell and marginal means in a three-factor 
treatment design 


Temperature: a =2 levels; Density: b = 2 levels; Salinity: c = 3 levels 


Main Factor Means 


Temperature Salinity Density 
IMSE [2903.78 IMSE [2903.78 IMSE [2903.78 
fic 18 rab 12 rac 18 
= 42.7 a 13.6 ole et | 


Two Factor Marginal Means 


Density by Temperature Density by Salinity 
MSE 2903.78 MSE 2903.78 
oe 9 coe 6 
=.13.0 a 220 
Salinity by Temperature Cell Means 


MSE 2903.78 2903.78 
i 6 3 
is 220 


25°C 35°C 


£ e 
= < 
O oe 
S S 
N N 
@ 80 Density @ 80 Density 
oO io] 
= © 160 Density = © 160 Density 
oO Oo 
Salinity Salinity 


Figure6.6 Weight gain of shrimp cultured in aquaria in a 2 x 2 x 3 factorial 
arrangement of temperature, density, and salinity 


The salinity quadratic coefficients were computed as orthogonal polynomial 
contrasts for the four combinations of temperature and density from the cell means 
in Table 6.15 following a template similar to that provided in Table 6.12. For 
example, the quadratic coefficient for salinity at 25°C and a density of 80 is 
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[70 — 2(466) + 359] 503 99:8 
wan, tr ee eae: 
with a standard error ,/2903.78/6(3) = 12.7. The 95% SCI estimates were com- 
puted for the four coefficients based on the Bonferroni ¢95 4.24 = 2.70. 
The 95% SCI estimates of the quadratic coefficients for salinity at 25°C are 
(— 118.1, — 49.5) for a density of 80 and (— 91, — 22.9) for a density of 160, 
while the estimates at 35°C are (— 17.5, 51.1) for a density of 80 and ( — 44.6, 
24.0) for a density of 160. Clearly the quadrature at 25°C is significant since the 
95% SCI do not include 0 and not significant at 35°C since those intervals include 
0. 


6.7 Estimation of Error Variance with One Replication 


Situations arise in research studies wherein only one observation is available in 
each cell of a factorial arrangement. The experimental error variance cannot be esti- 
mated with only one replication of the treatment combinations. The sums of 
squares partitions for factor main effects and interaction are equal to the total sum 
of squares for the observations. 

_ Additivity describes the case when there is no interaction between factors. 
Under additivity of factors the mean square partition for interaction can be used as 
an estimate of experimental error. The additivity of main effects or absence of 
interaction is not guaranteed, and some means of evaluating the presence of interac- 
tion is required. 


Error Variance Estimates with Two Quantitative Factors 


The additivity of quantitative factors can be investigated with the interaction com- 
ponents for linear and possibly quadratic regression partitions (Section 6.5). For 
example, the sum of squares for linear x linear interaction can be partitioned out of 
the interaction sum of squares with the assumption that the remaining sum of 
squares for deviations from linear x linear interaction is experimental error. These 
sums of squares for deviations from the linear x linear interaction would include all 
higher orders of polynomial interaction such as linear x quadratic, and so forth. 
The mean square for deviations from the linear x linear interaction can be used as 
the mean square for error. The number of | degree of freedom interaction terms 
that are partitioned from the interaction is a matter of judgment based on the num- 
ber of degrees of freedom available for a reasonably powerful test for interaction 
and main effects. 


Error Variance Estimates with a Qualitative and a Quantitative Factor 


The same approach can be used if one of the factors is qualitative and the other is 
quantitative. In this case, the sum of squares for the interaction between the qualita- 
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tive factor and the linear effect of the quantitative factor can be partitioned out of 
the interaction sum of squares (Section 6.5). The remaining deviations’ sum of 
Squares can be used to estimate experimental error. 


Error Variance Estimates with Two Qualitative Factors 


If both factors are qualitative the problem is somewhat more difficult. Tukey 
(1949b) gave a method for isolating a 1 degree of freedom sum of squares to test 
for nonadditivity in a two-way classification with one observation per cell. The 
term for nonadditivity in the linear model is a simple product of the main effects, 
Aa;{;, where the parameter \ represents the added parameter for nonadditivity. 
The product of main effects is a multiplicative form of interaction, and if there is 
nonadditivity from this specific type of interaction between the main effects, a; and 
(;, then A # 0. Under this model the cell means are a sum of the general mean, the 
factor effects, and the product term, or 


Mig = B+; - 2 )+ Gj -B) + AG; - Bh); — 2.) 


The sum of squares for nonadditivity requires a computation involving the 
deviations of the A and B means from the grand mean, (y; — y_) and (ree @ 
respectively. The technique is illustrated with Example 6.6. 


Example 6.6 Hearing Levels in Adult Males 


The data in Table 6.17 are the percentage of men aged 55 to 64 with hearing 
levels 16 decibels above the audio metric zero. The row categories were 
sound levels in cycles per second (hertz), and the column categories were 
seven occupational categories. 


The required computations include 


P;= SG, —F_yis (6.30) 
=i 
b 
P=)" PG; —¥_.) = 17,416 (6.31) 
j=1 
and 
a b 
>> G:.-9.¥ =6,944, SY @;-7) = 162 (6.32) 
i=1 j=l 


The 1 degree of freedom sum of squares for nonadditivity is 
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Table 6.17 Percentage of men with hearing levels 16 decibels above the audio 
metric zero classified in a 7 x 7 factorial arrangement with one observation per cell 


LL 


B 
A pea ee eee se 4, OG. -B) 
] ye 6.8 8.4 i -14.6 79 48 6.6 — 31.6 
2 7 8.1 8.4 La 2 Ee 4.5 Ss] — 32.5 
3 144 148 oe 3 eS OO 1a 273 — 10.9 
4 S74. - 62.4 S74 633 635. 656 - 59.8. 5838 20.6 
5 66.2 .#8t7 a4 3 807. 70. 808. 824... 150 36.8 
6 75.2 94.0 743. $79" 93.3 °81.8 ~8003." "$4.7 46.5 
7 a 107 10.7 55 3281-174 6.1 9.4 — 28.8 


Boe te ed | Oe ed Bog. i. ae 7 = 382 
at 1 88 15 .=64 . 05 fees cae eae | 
FP, Gi19,. 7730 . 5046. - 7776,..6890. 71313. J192 


a a ec NS 
Source: C. Daniel (1978), Patterns in residuals in the two-way layout. Technometrics 20, 385-395. 

Data originally published in J. Roberts and J. Cohrssen (1968), Hearing Levels of Adults, Table 4, p. 

36. US. National Center for Health Statistics Publications, Series 11, No. 31. Rockland, Md. 


P? 

SQcnaddigvity) = —— 

d geek d, 7) 
= j= 

17,416? e 

~ (6,944)(162) 


(6.33) 


269.6 


The analysis of variance for the data is shown in Table 6.18 with the sum of 
squares for error partitioned into a 1 degree of freedom sum of squares for non- 
additivity and a residual sum of squares. 


Table 6.18 One degree of freedom partition for nonadditivity in the analysis of 
variance for a 7 x 7 factorial with one observation per cell 


Source of Degrees of Sum of Mean 
Variation Freedom Squares Square 
Rows 6 48,589.1 8,098.2 
Columns 6 17413 190.2 
Error 36 1,444.7 40.1 
Nonadditivity l 269.6 269.6 


Residual 3D Pe gin te 33.6 


eer tern 
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The null hypothesis of no nonadditivity is tested with the statistic Fo 
= MS(Nonadditivity)/M S(Residual) = 269.6/33.6 = 8.02. The null hypothesis 
is rejected with a critical region Fo > F'o5,.1.35 = 4.12. 

Several methods have been developed to ascertain the source of nonadditivity 
in a two-way table. Daniel (1978) used a method based on the residuals in each of 
the cells, y;; — Y;, — ,; + y,,. Technical discussions and examples of several other 
methods and models for nonadditivity can be found in Johnson and Graybill 
(1972), Bradu and Gabriel (1978), and Mandel (1971). 


6.8 How Many Replications to Test Factor Effects? 


Procedures were given in Section 2.14 to estimate replication numbers based on the 
test for differences among treatment means with the Fo statistic. The values of ® 
(Equation (2.25)) can be applied directly to a test for differences among cell means 
in the factorial arrangement with the null hypothesis Ho: 44, = p42 = «++ = Map. In 
this case, the factorial structure is ignored and the cell means model 4; ;x 
= piz + ej; expressed in the effects model form is y;;, = pp + 7%; + eij~, where 7;; 
is the effect of the 27th treatment combination in the factorial arrangement. Then 


(6.34) 


is used to estimate replication numbers from the charts based on the values of 7;; 
required to be significant. 

If replication numbers based on the factorial effects are required, the non- 
centrality parameters are 


5 ay, : = 
= aes 2? Ayman) if and Mo =r) ) (6.35) 
j=1 


respectively, for A and B main effects and AB interaction. Then ® is determined 
as ® = ,/\/(v, + 1), where 1; are the numerator degrees of freedom for the Fo 
statistic. 


6.9 Unequal Replication of Treatments 


Missing data in research studies is inevitable. The design is no longer balanced with 
a complete data set, and standard computing formulae no longer apply. Before the 
advent of modern computing, a complete data set was most advantageous because 
relatively simple formulae could be used for manual computations. Much effort 
was put into developing methods for the analysis of variance sum of squares 
partitions when there were unequal numbers of observations among the cells of the 
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factorial arrangement. General statistical routines programmed to accommodate 
known statistical theory have removed the computational burdens associated with 
the analysis of incomplete data sets. 


Orthogonality Is Lost with Missing Observations 


The sum of squares for one factorial effect will convey some information about 
other factorial effects when there are unequal numbers of observations for the treat- 
ment combinations and the sum of squares partitions are computed in the usual 
manner. This non-orthogonal relationship in the sum of squares partitions for the 
analysis of variance requires us to consider carefully the estimates we use for 
parameters in the model and the statistics we use to test the critical hypotheses in an 
analysis of the study. 

Orthogonal contrasts were introduced in Chapter 3 as contrasts that do not con- 
vey information about one another. Orthogonality carries the same meaning with 
observations in a factorial treatment design. When there are equal numbers of 
observations on each treatment combination, the sums of squares in the analysis of 
variance constitute an orthogonal partition of the treatment sum of squares. In Sec- 
tion 6.4 the additive partition of S'S Treatment for a balanced two-factor experi- 
ment was 


SS Treatment = SSA+SSB+ SS(AB) 


and the sum of squares for one factorial effect did not convey any information 
about other factorial effects. 

The example data shown in Display 6.5 illustrate the complications introduced 
by unequal treatment replication in a factorial treatment arrangement. The data in 
the cells represent the average speed in excess of the posted speed limit traveled by 
automobiles involved in 20 fatal accidents, 10 occurring in rainy weather and 10 
occurring in clear weather. The factors for the study are W (Weather) and R (Type 
of Roadway). Notice that 8 of the 10 accidents in rainy weather were on interstate 
highways, whereas 8 of the 10 accidents in clear weather were on two-lane 
highways. 

Observation of the marginal means for weather indicates the average speed in 
excess of the speed limit for rainy weather was slightly higher than that for clear - 
weather, 7, = 13 versus Y, = 12. However, the observed cell means indicate an 
entirely different result. The average speeds in fatal accidents in clear weather were 
greater by 5 miles per hour than in rainy weather for both interstate (20 — 15) and 
two-lane (10 — 5) highways. 

The unequal treatment replications lead to contradictory results from cell 
means and marginal means. An excess of accidents occurred on the interstates in 
rainy weather and an excess of accidents occurred on the two-lane highways in 
clear weather. Thus, the marginal mean for clear weather is biased downward by 
the excess of accidents on the two-lane highways with overall slower speeds, and 
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Display 6.5 Unequal Treatment Replication in a2 x 2 Factorial 
for Speeds in Excess of Posted Speed Limit 


Cell Means 
Interstate Two-Lane Sums 


| Be, 7 130 
T11 = 8 T2= 2 TY. = 10 


20 10 120 
160 
be a 10 


16 


the marginal mean for rainy weather is biased upward by the excess of accidents on 
the interstates with overall faster speeds. 

The sum of squares for treatments with unequal replication numbers is com- 
puted correctly as 


SS Treatihent= 8015+ 125¥ 4026 12:57 +200 128) + 8(10 — 12.5) 


= 325 


The sums of squares for main effects and interaction computed incorrectly with 
methods outlined in Section 6.4 are 


SSW = 10(13 — 12.5)? + 10(12 — 12.5)?=5 


SSR = 10(16 — 12.5) + 10(9 — 12.5)? = 245 
and 
SS(W R) = SS Treatment — SSW — SSR = 325 —5 — 245 = 75 


Inspection of the cell means and the sum of squares for interaction indicate an- 
other contradiction in the usual analysis methods. The calculated sum of squares, 
SS(W R) = 75, indicates some interaction is present in the study. However, in- 
spection of the cell means gives no indication of interaction. The observed simple 
effect of weather is equal to 5 for both highway types. The sum of squares for inter- 
action would be 0 if the sum of squares partitions were computed correctly. 

The general principles for a correct analysis of factorial treatment designs with 
unequal treatment replication are illustrated with the two-factor experiment on 
durability of asphaltic concrete. 
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Example 6.7 Asphaltic Concrete Durability Revisited Again. 


The experiment on tensile strength of asphaltic concrete specimens in Exam- 
ple 6.2 is used to illustrate the analysis of a factorial treatment design with 
unequal replication of the treatments. For illustration, suppose specimens 
were constructed with basalt or silicious aggregate types for the three knead- 
ing compaction methods. Suppose some of the specimens were damaged pri- 
or to testing, resulting in an unequal number of specimens being available 
among the treatments for tensile strength tests. The data with unequal replica- 
tion numbers are shown in Table 6.19. 


Table 6.19 Tensile strength (psi) of asphaltic concrete specimens for two 
aggregate types with each of three kneading compaction methods 


Compaction Method 
Kneading 
Aggregate Aggregate 
Type Regular Low Very Low Means (Y;_) 
Basalt 106 93 56 
108 101 
98 
Means (Y;, ) 107.0 913 56 937 
Silicious 107 63 40 
110 60 4] 
116 Aq 
Means (¥,, ) 111.0 61.5 41.7 72.6 
Compaction 
means (¥ ; ) 109.4 83.0 45.3 


Establish Estimators with the Cell Means Model 


The cell means model can be used to establish the appropriate estimators for popu- 
lation parameters and hypotheses that we are required to test. The cell means model 
is 


Yijk = Miz + Cizk (6.36) 


b= ie, 6 72.6 k= 12. ty 


where p;; is the cell mean for the zth level of factor A and the jth level of factor B, 
€ijk 1S the normally distributed random independent experimental error with mean 0 
and variance o?, and r;; 1s the number of replicate observations in cell (27). We will 
assume there is at least one observation in each cell of the factorial arrangement, so 
that r;; > 0 for all 2 and 7. 
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Least Squares Estimators for Cell and Marginal Means 


The cell means can be estimated by the least squares method following the proce- 
dures outlined in Chapter 2 for the cell means model. The estimators of the cell 
means are the observed cell means 


ig = — D  Yigk = Diy (6.37) 
and the estimator for experimental error variance is 
S | ~ 
GO = 3? = ——___ Yi (yige — Hig) (6.38) 
—ab 
where N = di 7j3.. 


The estimates of the cell means for the tensile strength of the asphalt concrete 
specimens are shown in Table 6.19, and the estimate of experimental error is 


| 89.83 
3? = (106 — 107.0)? +--+ 44 SUP ——— = 113 
14=6 8 
The unbiased least squares estimators of the marginal means are 
a oe a ieee 
eee dP and* pj; = A dP (6.39) 
j= a 

and sa = (6.40) 


The least squares estimates of the marginal means for the asphalt concrete speci- 
mens and their standard error estimates are shown in Table 6.20. 

For example, the least squares estimate of the marginal mean for the basalt 
aggregate type is 


] 
a 3 (107.0 + 97.3 + 56.0) = 86.8 


with standard error estimate 


= _— 
oes oe se 1 


The observed marginal means, y; and y,, shown in Table 6.19 do not have 
the same value as the least squares estimates of the marginal means in Table 6.20. 
The observed marginal means estimate weighted functions of the population means 
where the weights are proportional to the number of replications in the cells. The 
expected values of the observed marginal means are 
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Table 6.20 Least squares estimates of the marginal means for tensile strength of 
asphalt concrete specimens and their standard errors 


Standard 
Mean Error 
Aggregate Lh; oe 
Basalt 86.8 151 
Silicious yee Tee 
Compaction nn ‘ Sa 
Regular 109.0 1.53 
Low 79.4 33 
Very low 48.8 1.93 
+ oath 
BG; J = >. tans 
a 
and (6.41) 


a 
EG 5) = = 2 iG 

"i 71 
If the number of observations in the treatment cells of the study is proportional 
to the frequency with which those treatment combinations occur in the population, 
then the observed marginal means provide the appropriate estimators for the mar- 
ginal means in the population. The proportional relationship of observation num- 
bers to population frequencies is common in sample surveys. However, the 
proportional relationship would not be expected to hold for a designed experiment 
or comparative observational study, and the least squares estimates in Table 6.20 

should be used. 


Hypotheses Unchanged by Unequal Treatment Replication 


The hypotheses of interest in the factorial treatment design with unequal replication 
numbers are unchanged from those of interest with equal replication numbers. The 
initial research question in the factorial treatment design considers the existence of 
interaction between factors A and B. The interaction effect measures the differ- 
ences between the simple effects of A at different levels of B. The difference be- 
tween levels z and k of A at levels 7 and m of B is the general form of interaction: 


(Li; = Ukj) = (Lin — Hin) = big — Ukj — Lim + ULkm 


The null hypothesis of no interaction can be expressed in terms of the cell means as 
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Ao: bij — kj — Mim + Lem = 0 for all 2, 7,k, and m (6.42) 


In the absence of interaction, the effect of the individual factors on the 
response variable can be explored separately with tests of hypotheses about the 
marginal means. The null hypothesis of interest for factor A is the equality of the 
marginal means, or 


fg: She =P (6.43) 
and that for factor B is 
Ao: fy =B2="':=By (6.44) 


Weighted Squares of Means for Tests of Hypotheses 


Among the many methods put forth for analyzing factorial experiments with un- 
equal replication only the method of weighted squares of means proposed by Yates 
(1934) provides the sum of squares partitions to test all three hypotheses in Equa- 
tions (6.42) through (6.44). A description of other methods and the hypotheses that 
can be tested with them can be found in Speed, Hocking, and Hackney (1978). The 
tests for equality of marginal means are of interest only in the absence of 
interaction. 


Computing Sum of Squares for Interaction from Full and Reduced Models 


The sum of squares partition for interaction is determined from the principle of full 
and reduced models introduced in Chapter 2. The full model expressed in terms of 
the factorial effects is 


Visk Sofbaenda ly (eB ig + eizk (6.45) 


The solutions obtained from the least squares normal equations are used to compute 
the sum of squares of experimental error for the full model as 


a bij 
SSE, =~ S~S~ (yin —-B- Gi — B; - (2B)? (6.46) 
i=1 j=1 k=1 
Under the null hypothesis of no interaction the reduced model is 
Ys wich 0+ Cap (6.47) 
The solutions obtained from the least squares normal equations are used to compute 
the sum of squares of experimental error for the reduced model as 


Vij 


a b 
SSE, =>) >) >) lyin —B- 8: — BP (6.48) 
io) gl. bal 
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The sum of squares for interaction is computed as 
SS(AB) = SSE, — SSE; (6.49) 


The mean square for experimental error is MSE = (SSE¥)/(N — ab), and 
the mean square for interaction is S(AB) = SS(AB)/(a — 1)(b — 1). The usual 
Fo statistic, Fy = MS(AB)/MSE, tests the null hypothesis of no interaction in 
Equation (6.42). The calculations are illustrated in Appendix 6A. 


Weighted Squares of Means to Test Equality of Main Effects in the Absence 
of Interaction 


Tests of hypotheses can be conducted for the equality of marginal means of the fac- 
tors if the test for interaction is not significant and it can safely be assumed there is 
no interaction. The correct sum of squares partitions for the weighted squares of 
means method to test the null hypotheses in Equations (6.43) and (6.44) are shown 
in Display 6.6. The analysis is based on the sums of squares of the cell means 
designated as the observations xj; = 9; 


Display 6.6 Weighted Squares of Means Sum of Squares Partitions 


Factor A Factor B 
SS Ay =), We, — 2a) SSBy = ae Ui(E.j3 — Fy)? 


er y 
a 
er ty ? ij 


Ty = Qa; Wii, / Di Wi Tj =D; 0t/ 5% 


Ho: fy =o =" = Bh, Ao: fy =Bo =-°=By 


The sum of squares partitions required by the weighted squares of means can 
be computed by many statistical programs. However, the programs may have sev- 
eral options for the type of sum of squares partitions that are computed for the 
analysis. It is important that the correct options be used for the programs so that the 
correct sum of squares is computed by the program. ! 


1 Programs used for analysis of variance will provide the correct sum of squares for the weighted 
squares of means. Instructions on the use of most programs will indicate if and how different types of 
sum of squares partitions can be obtained. The correct sum of squares options for several well known 
programs are 

Program Sum of Squares 

SAS GLM Type Ill 

SPSS MANOVA UNIQUE 

MINITAB GLM _ Adjusted 

BMDP 2V Default 

Splus summary.aov(...,sS Type=3) 
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The analysis of variance for the asphaltic concrete specimens tensile strengths 
in Example 6.7 is shown in Table 6.21. 


Table 6.21 Analysis of variance for tensile strength of asphaltic concrete speci- 
mens with unequal replication of treatments with Yates’ weighted squares of means 


Source of Degrees of | Sumof Mean 

Variation Freedom Squares Square F Pr> fF 
Total 13 10,963.21 

Aggregate l 710.45 710.45 bon 7 000 
Compaction Z 6,806.45 3,403.23 303.07 .000 
Interaction Be 953.45 476.72 42.45 .000 
Error 8 89.83 Fis 


Interpretation of the Example 


The null hypothesis of no interaction between aggregate type and compaction 
method is 


Ho: pj — Pej — Pim + km = 90 for all2,7,k, and m (6.50) 


Interaction is significant since the statistic Fy = MS(AB)/MSE= 
476.72 /11.23 = 42.45, in Table 6.21, is significant with Pr > F’ = .000. 

With significant interaction between aggregate type and compaction method it 
will be necessary to look at the simple effects of one factor at each of the levels of 
the other factor to understand the nature of the interaction. Comparisons between 
the cell means for aggregate type at each level of compaction method are shown in 
Display 6.7. 


Display 6.7 Bonferroni ¢ Tests for Simple Effects of Aggregate 
Type for Each Compaction Method 


Compaction Standard 
Method Ly wee ies Error 


Regular 107.0 — 111.0 = -4.0 P23 


Low | 97.3 — 61:3 = 35.8 [1.23 


Very low 56.0 — 41.7 = 14.3 rie 
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The to statistics were computed for the contrast between aggregate types for 
each of the compaction methods in Display 6.7. The critical value for the 
Bonferroni ¢ statistic with three comparisons is | to | > t.o95,3.3 = 3.02. There is no 
significant difference between tensile strengths of the basalt and silicious rock 
specimens with the regular kneading compaction method. The tensile strengths of 
the basalt specimens were significantly greater than those for the silicious rock 
specimens for low and very low kneading compaction methods. 


Tests for Marginal Means 


The tests of hypotheses for equality of the marginal means for A and B ordinarily 
are not considered when interaction is significant and would not be considered for 
the current example. However, for the sake of illustration, the procedure is illus- 
trated for the case when no interaction exists and tests about the marginal means 
would be of interest. The two hypotheses to test for the asphaltic concrete example 
are (1) no differences among the marginal means for aggregate type 


Ho: Hy, = Ho, (6.51) 
and (2) no differences among the marginal means for compaction method 
fo: By =P: =f (6.52) 


The sum of squares for the weighted squares of means method are shown in 
Table 6.21. The statistic Fy = MSA/MSE = 710.45/11.23 = 63.26 tests the 
equality of marginal means for aggregate type. The statistic for the test of equality 
among the marginal means of compaction method is Fy = MSC/MSE = 
3403 .23/11.23 = 303.05. Both of the statistics are significant with Pr > F = .000 
in Table 6.21. 


Some Comments About Missing Data and Missing Cells 


A method was illustrated in this section to analyze the data from a study with un- 
equal subclass replication in a factorial treatment design. The method provides the 
correct estimators for population means and credible tests of hypotheses about the 
factors. Other methods of analysis for the unbalanced factorial treatment designs 
provide different tests of hypotheses about the factor effects. 

Searle, Speed, and Henderson (1981) discussed five methods for calculating 
sums of squares in the analysis of variance that were currently used in computer 
programs. All of the methods give the same results for balanced data, but they can 
yield different results for unbalanced data. Related articles by Hocking and Speed 
(1975), Speed and Hocking (1976), and Speed, Hocking, and Hackney (1978) pro- 
vide additional information on the computing methods and the hypotheses that are 
tested in the analyses by different computer programs. More extensive illustrations 
of some methods, including that used in this section along with some of the 
theoretical background, are found in Searle (1971, 1987) and Milliken and Johnson 
(1984). 
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All of the methods discussed in the context of the sum of squares partitions for 
main effects and interaction provide inadequate tests of hypotheses about factorial 
effects when entire cells of the factorial arrangement are missing. Under these 
circumstances an analysis based on the cell means model is recommended by 
Urquhart, Weeks, and Henderson (1973), Hocking and Speed (1975), Urquhart and 


Weeks (1978), and Searle (1987). 


EXERCISES FOR CHAPTER 6 


1. A chemical production process consists of a first reaction with an alcohol and a second reaction 
with a base. A 3 x 2 factorial experiment with three alcohols and two bases was conducted with 
four replicate reactions conducted in a completely randomized design. The collected data were 


Alcohol 


percent yield. 
Base ] 
] 2 ee he 
90.7 91.4 
) 87.3 89.4 
91:5 .88.3 


Source: P. R. Nelson (1988), Testing for interactions using analysis of means. 


Technometrics 30, 53-61. 


a. Write a linear model for this experiment, explain the terms, and compute the analysis of vari- 


ance for the data. 


Make a table of cell and marginal means, and show their respective standard errors. 

c. Test the null hypotheses of no base x alcohol interaction effects. What do you conclude from 
the test? What do you recommend as the next step in your analysis? 

d. Use multiple contrasts among cell means to help explain the interaction. For example, compare 


the two bases for each alcohol. 


e. Conduct residual analyses with a normal plot and then with a predicted plot; also conduct a 


Levene (Med) test. What do you conclude? 


2. A company tested two chemistry methods for the determination of serum glucose. Three pools of 
serum were used for the experiment. Each pool contained different levels of glucose through the 
addition of glucose to the base level of an existing serum pool. Three samples of serum from each 
pool were prepared independently for each level of glucose with each of the two chemistry meth- 
ods. The concentration of glucose (mg/dl) for all samples was measured on one run of a spectro- 


photometer. 


e. 


EXERCISES 219 


Method | Method 2 
Glucose Level ] 2 3 ] Bs 3 
42.5" “38.4 ~180-9 wo se TTS 
43.3 144.4 180.5 S0.> loi 1750 
42.9 142.7 183.0 4i2 130.3. 1749 


Source: Dr. J. Anderson, Beckman Instruments Inc. 


Write a linear model for this experiment, explain the terms, conduct an analysis of variance for 
the data, and compute the residuals. Is a transformation of the data necessary? Explain. 

If a transformation is necessary, compute the transformation for the data and the analysis of 
variance. 

Test the hypotheses of no method x glucose interaction effects. What do you conclude? 
Should you test for main effects? Why? 

Prepare a table of cell and marginal means with their respective standard errors. 

Test the difference between method means for each level of glucose, and interpret the results. 


A study of the effect of temperature on percent shrinkage in dyeing fabrics was made on two 
replications for each of four fabrics in a completely randomized design. The data are the percent 
shrinkage of two replicate fabric pieces dried at each of the four temperatures. 


Temperature 
Fabric’ 210° F © 215° 8 = 220° F 225'°F" 
] io, 21 “2c 80,20 “Tore 
2 22,24 42,40 34, 5.0 ~9.8,9°2 
3 2.8,3.2 4.4,4.8 8.7,8.4 13.2, 13.0 
~ o43,0 3.3; 0.5- 0, 3 107, 11,1 


Write a linear model for the experiment, explain the terms, and compute the analysis of 
variance for the data. . 
Test the null hypothesis of no fabric x temperature interaction. 

Partition the temperature main effect sum of squares into 1 degree of freedom partitions ae 
linear and quadratic regression sum of squares, and test the null hypotheses of no linear or qua- 
dratic response to temperature. 

Partition the temperature x fabric interaction sum of squares into temperature linear x fabric 
and temperature quadratic x fabric interaction sum of squares, and test the null hypotheses of 
no interaction for the respective partitions. 

Prepare a profile plot of the cell means versus temperature for each fabric, and interpret the re- 
sults. For example, the following questions may be asked: “How does drying temperature af- 
fect the fabric shrinkage?” “How does the relationship between shrinkage and temperature 
differ among the fabric types?” 


An experiment in soil microbiology was conducted to determine the effect of nitrogen fertility on 
nitrogen fixation by Rhizobium bacteria. The experiment was conducted with four crops: alfalfa, 
soybeans, guar, and mungbean. Two plants were inoculated with the Rhizobium and grown in a 
Leonard jar with one of three rates of nitrogen in the media: 0, 50, or 100 ppm N. Four replications, 
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Leonard jars, were used for each of the 12 treatment combinations. The treatments were arranged 
in a completely randomized design in a growth chamber. The acetylene reduction was measured 
for each treatment when the plants were at the flowering stage. Acetylene reduction reflects the 
amount of nitrogen that is fixed by the bacteria in the symbiotic relationship with the plant. 


Crop 
Nitrogen Alfalfa Soybean Guar  Mungbean 
0 20,01 tO, 2.0 0, et 0.8, 0.9 
OS 12 "55 43°02 028 bie Dake ee 
50 0.0, 0.0°° 0.6,0:6 §0.0, 0.1 0.7, 0.4 
60,00 -0.3, 0.3 0.0, 0.2 0.3, 0.8 
100 00,00 U0, 01 09.4.2 0.3, 0.1 
00,0)... 01,00 0.0.00 0.0, 0.1 


Source : Dr. I. Pepper, Department of Soil and Water Science, 
University of Arizona. 


a. Write a linear model for this experiment, explain the terms, and compute the analysis of 
variance. 

b. Perform a residual analysis and determine whether a transformation of the data is necessary. 
Transform the data if necessary, and compute the analysis of variance for the transformed data. 

c. Test the null hypotheses of no crop, nitrogen, or crop x nitrogen interaction effects. 

d. Partition the nitrogen main effect and the nitrogen x crop interaction sum of squares into | de- 
gree of freedom partitions for linear and quadratic regression. 

e. Test the null hypotheses of no nitrogen linear or quadratic main effects and the null hypotheses 
of no nitrogen linear or quadratic interaction with crops. 

f. Make a profile plot of the cell means versus level of nitrogen for each crop, and interpret the 
experiment. For example, you can ask the question, “How does the addition of nitrogen to the 
media affect the nitrogen fixation by the Rhizobium?” or “Is the effect of the addition of nitro- 
gen on nitrogen fixation the same for each crop?” 

g. Note that two treatment combinations, alfalfa with 50 and 100 ppm N, have all observations 
with a value of zero. This phenomenon is possible if the presence of a threshold level of nitro- 
gen in the growth medium completely inhibits Rhizobium activity. How does this affect the 
assumptions for the analysis of variance regarding homogeneity of variance? Do you have any 
suggestions to accommodate this situation in your analysis of the data? 


5. An agronomist conducted an experiment to determine the combined effects of an herbicide and an 
insecticide on the growth and development of cotton plants (delta pine smoothleaf). The insecticide 
and the herbicide were incorporated into the soil used in the containers to grow the cotton plants. 
Four containers each with five cotton plants were used for each treatment combination. The con- 
tainers were arranged in the greenhouse in a completely randomized design. Five levels (Ib/acre) 
were used for both the insecticide and the herbicide to give 25 treatment combinations. The data 
that follow are cell means for the dry weight of the roots (grams/plant) when the plants were three 
weeks old. 
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| Herbicide 
Insecticide 0 0.5 1.0 5 2.0 
0 122.018 G7 52,00"? 3625 29.25 
20 B2.fo Sa,7— FISO: (80.59 72.00 
40 S.7o Oe feu GP 82.50 
60 Cou fom --68./5 “17.25 68.25 
80 Si. CO BO 6.23 Tee 


Mean Square for Experimental Error = 174 with 75 degrees of freedom 


Source: Dr. K. Hamilton, Department of Plant Sciences, University of Arizona. 


a. Compute | degree of freedom regression sum of squares partitions for herbicide, insecticide, 
and interaction sums of squares. Compute no higher polynomial than cubic regression for 
herbicide or insecticide. 

b. Test the null hypotheses for each of the partitions, and determine the form of the polynomial 
regression that adequately describes the response. 

c. Transform the orthogonal polynomial equation into an equation in terms of herbicide and 
insecticide. Either use a standard regression program or the transformation equations in Chap- 
ter. 3. 

d. Interpret the results from plots of the cell means or the estimated polynomial equation. 


An experiment was conducted on the durability of coated fabric subjected to standard abrasive 
tests. The 2 x 2 x 3 factorial design included two different fillers (F,, F)) in three different pro- 
portions (25%, 50%, 75%) with or without surface treatment (S;, S2). Two replicate fabric speci- 
mens were tested for each of the 12 treatment combinations in a completely randomized design. 
The data are weight loss (mg) of the fabric specimens from the abrasion test. 


Surface and Filler Treatments 


S3 So 

Proportion of Filler Fy, F5 Fy, Fo 
25% 194 239 155 137 

208 187 173 160 

50% 233 224 198 129 

241 243 177 98 

75% 265 243 250 135 

269 226 229 lag 


Source: G. Box (1950), Problems in the analysis of growth and wear curves. 
Biometrics 6, 362-389. 


a. Write a linear model for the experiment, explain the terms, and compute the analysis of vari- 
ance for the data. 
. Prepare a table of cell and marginal means with their respective standard errors. 
c. Test the null hypothesis for all main and interaction effects. 
d. Compute the 1 degree of freedom regression sum of squares partitions for proportion of filler 
and interaction between proportion of filler and the other factors. 
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e. Test the null hypotheses for the regression partitions. 
f. Plot cell means versus proportion of filler for the four treatment combinations of surface and 
filler type, and interpret the results of your analysis. 


7. A soil scientist conducted an experiment to evaluate a four-electrode resistance network to compute 
electroconductivity (EC) of soil in specially constructed acrylic conductivity cells. The objective of 
the study was to evaluate the relationship between measured EC and soil water salinity at different 
water contents of soils. Three basic soil textures were included in the experiment since EC is spe- 
cific to soil texture. The cells were constructed of acrylic tubing, 4-cm long by 8.2-cm diameter, 
and packed with soil. Two cells were used for each treatment combination. The three soil types 
were loamy sand, loam, and clay. The salinity of the soil water, three levels, was based on the EC 
of the water at 2, 8, and 16 dS/m (decisiemens/meter). The water content of the soil was three lev- 
els, at 0%, 5%, and 15%. The resulting experiment was a3 x 3 x 3 factorial arrangement with two 
replications in a completely randomized design. The EC values of the soil determined on the basis 
of readings from the four-electrode network follow. 


Salinity 2 8 16 

Water 0 3, 13 0 5 id 0 =) is 

Loamy 0.60 1.69 3.47 0.05 0.11 0.06 0.07 0.08- 0-22 
sand 0.48 2.01 3.30 OT? 0s 09 0.06" Cre tT 


Loam 0.98 2.21 5.68 0.15 0.23 0.40 0.07 023-443 
O95 246. 3.11 ee ae O21 U.33- 
Clay je eS ee U2 Ufa. 2.10 G40 02 i> 


Loy ee Je U5 117 18 O05) Mime a! 


Seate te trt sa ae ce oto n oie ee SEM Se inne e eee, Oe Me Tee Fat | Lae eee ere eeer semen ete rne! FRE SS SERS x 
Source: H. Bohn and T. Tabbara, Department of Soil and Water Science, University of Arizona. 


a. Write a linear model for this experiment, explain the terms, and compute the analysis of vari- 

ance for the data. 

Prepare a table of cell and marginal means and their respective standard errors. 

Test the null hypotheses for all main effects and interactions. 

d. Compute the linear and quadratic orthogonal polynomial regression sum of squares partitions 
for salinity and water and their interactions, including the interactions with texture. Note that 
the levels of salinity and water are unequally spaced; therefore, the standard orthogonal poly- 
nomial coefficients given in Appendix Table XI do not apply. Some statistical computing pro- 
grams will automatically compute the orthogonal polynomial coefficients, given values for the 
levels of the factors (for example, MANOVA in SPSS and 2V in BMDP). The following are 
orthogonal polynomial coefficients that may be used to compute the orthogonal partitions: 


ile 


Water linear: —0.617 —0.154 0.772 
Water quadratic: 0.535 —0.802 0.267 
Salinity linear: —0.671 —0.067 0.738 


Salinity quadratic: 0.465 —0.814 0.349 
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Five nickel rods of 1-mm diameter were put in a metallic clamp in a suspension of aluminum 
oxide. A 100-volt tension was applied between the nickel rods and the vessel containing the sus- 
pension of aluminum oxide. The thickness of the aluminum oxide layer deposited on the nickel 
rods was recorded at three height positions of the five rods. The data are thickness of the deposit in 
microns. 


Clamp Position of Nickel Rod 
Height ] 2 3 4 5 


Bas 130 128 134 145 
p 126 150 Lig 124 118 
3 130 155 168 159 138 


Source: H. Hamaker (1955), Experimental design in industry. Bio- 
metrics 11, 257-286. 


Write a linear model for this experiment, explain the terms, and state the model assumptions. 
Do you believe the model assumptions are valid for this experiment? Explain. 

Suppose the assumptions are reasonably valid. Compute the analysis of variance for the data. 
Compute the 1 degree of freedom sum of squares for nonadditivity. 

Is the additive model for clamp position and height sufficient for this data? 


on0 op 


An entomologist conducted an experiment on the drinking rate energetics of honeybees to deter- 
mine the effects of ambient temperature and viscosity of the liquid on energy consumption by the 
honeybee. The temperature levels were 20° C, 30° C, and 40° C. The viscosity of the liquid was 
controlled by the sucrose concentrations, which were 20, 40, and 60 percent of total dissolved 
solids in the drinking liquid for the bees. The entomologist recorded the energy expended by the 
bees as joules/second. The data given below are for three replications for each of the nine treatment 
combinations in a completely randomized design. 


Sucrose % 


Temperature °C 20 40 60 
20 Jit, 3i1a8. | 2.0, O49, 1-8 a, 9.4, 9.3 
30 65,69, 1.3.2015, 722.9, 1354 17.5, 15.8, 147 
40 Td, Oo, Ie Aol, Fe ole © F901, 18.0, 19.9 


Source: Dr. S. Buckman, USDA Bee Research Lab, Tucson, Arizona. 


a. Compute | degree of freedom regression sum of squares partitions for temperature, sucrose %, 
and interaction sums of squares. 

b. Test the null hypotheses for each of the partitions, and determine the form of the polynomial 

_ regression that adequately describes the response. 

c. Transform the orthogonal polynomial equation into an equation in terms of herbicide and 
insecticide. Either use a standard regression program or the transformation equations in Chap- 
ter 3. 

d. Construct a profile plot such as that in Figure 6.5 with the estimated cell means and the esti- 
mated polynomial equation. 

e. Interpret the results. 
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10. 


11. 


Unequal Replication Numbers: A biologist incubated adrenal glands of rats in vitro under stimula- 
tion by ACTH and measured steroid production of the glands. The glands were taken from the ani- 
mals at four different stages of growth and were subjected to two different treatments. Glands from 
four animals were used for each treatment combination. However, several laboratory analysis were 
invalid, which resulted in unequal replication numbers for the treatments. The data given below are 
steroid production per 100 mg of gland per hour. 


Treatment 
Stage - / Shoe 
| 6.98, 6.58 8.62, 9.40, 9.20 
g: 6.07, 7.16, 6.34 9.42, 6.67, 8.64 
3 5.38, 7.31, 6.65, 7.44 4.96, 6.80, 7.61 
4 TAZ, 9.25, Fad TAT, 1.65, 6.052. © oo 


Source: Dr. R. Chaisson, Department of Veterinary Science, 
University of Arizona. 

a. Compute the analysis of variance in order to test the global hypothesis of no Stage by Treat- 
ment interaction. 

. Compute the least squares means and their standard errors for marginal and cell means. 

c. Estimate the contrast between the two treatments for levels 1, 2, 3, and 4 of the growth stages 
(ii, Sti Tee for 7 = 1,2,3, 4), and test the hypothesis of no difference at the .05 level of signi- 
ficance between the two means in each case. 

d. How do the least squares means differ from the observed means? 


Unequal Replication Numbers: Suppose the experiment on the chemical production process in 
Exercise 6.1 had unequal replications among the six treatment combinations of the two factors, 
Base and Alcohol. The data are given below. 


Alcohol 
Base ] Pa 3 
i °90.7, 91.4 89.3, 88.1 89.5, 87.6 
90.4 88.3, 90.3 
2 87.3, 88.3 94.7 93.1, 90.7 
91.5 91.5 


a. Compute the analysis of variance in order to test the global hypothesis of no Base by Alcohol 


interaction. 
Compute the least squares means and their standard errors for marginal and cell means. 


Estimate the contrast between the two bases for levels 1, 2, and 3 of Alcohol (ji, vee Bos j= 
1,2,3), and test the hypothesis of no difference at the .05 level of significance between the two 


means in each case. 
d. How do the least squares means differ from the observed means? 
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6A Appendix: Least Squares for Factorial Treatment Designs 


Equal Treatment Replication 


The sum of squares partitions for the data from a factorial treatment design can be 
derived from solutions to the least squares normal equations for a factorial effects 
model. The full model for a factorial treatment design with two factors will be used 
to illustrate the derivation. 

For the sake of simplicity in notation the full model is written as 


Yigk = + OG + Ay + Vig + ge Oa 


PTR? oe GSP hy RSV 2, ... ,r 


where yz is the general mean, a; is the effect of factor A, G; is the effect of factor B, 
i; is the interaction effect, and e;;, is the random independent experimental error. 
The interaction term (a/);; used in the main body of this chapter has been replaced 
by +i; to simplify the notation for the presentation in this appendix. 

The least squares estimates for the parameters in the full model are those that 
minimize the sum of squares for experimental error 


>) , ; Sd, - 3 3 3 Gi p= a; ~ 2B; - Vij)? (6A.2) 


i=] j=1 k=1 i=1 j=1 k= 


The normal equations from the minimization include one equation for yw and one 


equation for each of the factorial effects, a1,Q@2,... ,Qa; (1, (Go,... , 6; and 
11) Y12)-++ »Yab- The normal equations are obtained from the following set of 
derivatives: 
2 sx 6;- Yu) = 0 
Ou ig To oe ei es 
O : 
re ae a; — $;-Vj) =O 1=1,2,...,a . 
a (6A.3) 
ee ee 8; — ys)’ = 0 j=2,3,...,0 
6] : 
Bag ET je 28 — Hy HO t= 1,2,..:,4 
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Simplifying, the set of normal equations to solve is 


po abrii + brds G; + ard B; + run 4i; = Y.. 
j j tj 


Qj: brfi + bra; + ro Bi +7dF,; = vi. Oe eee 
j j 
a (6A.4) 
ee ari + Tai + arh; + THis = Wy. g§=1,2,...,6 
Viz: rh + rai +78; +7F,5 = Vij (21, 2,...,8 
fee ee 


Upon close inspection the a equations derived for the factor A effects sum to 
the first equation for ju; the b equations derived for the factor B effects sum to the 
first equation for yz: as do the ab equations for interaction; the y;; equations summed 
over the 7 subscript will give the a; equation; and the 7;; equations summed over 
the 2 subscript will give the @; equation. These linear dependencies require con- 
straints imposed on the estimates to provide a unique solution to the equations. Any 
constraints that lead to a solution will suffice. One set of constraints commonly 
used are the sum-to-zero constraints. The sum-to-zero constraints are Da; = 0, 
Dep 0, 0 G = 1,2,... 0) ev G = 1,2,... ,0). 

With the constraints, the equations are 


bs: abrit = y... 
Oj! Sia ye oe i. eS tt 
= 6A.5 
B;: arp part = yee 0 oe) 
Vij: ri t+ rai +rB, +795 = Vij PEN Backs: 5G 
j= 2,...,9 
The solutions are 
f= =y 
abr 
a= -fe9, —F (eee 
7 1 so } (6A.6) 
p,=—-pP=9;,-9 j= 1,2, “o 
ar 
ne Y ae Cee oF as 
Vij = — -f-6;-6,=3,; mee ea ee 1= 1,2, . 
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The estimate of the sum of squares for experimental error is obtained with a 
substitution of the estimates 7, Gi, By, and7,, into Equation (6A.2) as 


The difference between the total sum of squares and SS'E; is known as the re- 
duction in sum of squares due to fitting the model and is sometimes written as 
R (yu, a, 8, y). With equal replication for all treatment combinations the sum of 
squares for each factorial effect can be derived from the computation used to com- 
pute R(, a, G, vy). The reduction in sum of squares due to fitting the full model is 


a b a b 
R(y, a,B, y= BY... i Saige: ai S- Bij. + ye > Fits (6A.8) 
1 eet j=l 


For balanced data with equal replication numbers for each treatment combina- 
tion, the sum of squares partitions for the analysis of variance can be taken from the 
individual terms in Equation (6A.8) as 


sHity 3)? 
abr 


SSA = - AY. = ie Gi. — 9.) 
i=l = 


CF = fly... 


prot b (6A.9) 
ss = > Py: = ar) Ce v) 
j=) fe 
a b a b ; 
SS(AB)=)— YS 49,4. =T>_ Gi. — Fi Fy. + 
i=1 j=l i=1 j=l 


The sum of squares partitions shown in Equation (6A.9) are those shown in 
Section 6.4. They may be derived from considerations of full and reduced models. 
For example, the sum of squares for interaction SS(AB) can be found as the dif- 
ference between the experimental error sums of squares for the reduced model 
without the interaction terms and full model with interaction terms included. The 
models and sums of squares are 


Full Model: yi; = w+ a; + i; + 4; Pep with SSE, 
Reduced Model: Yijk = UO B; + Ci5k with SSE, 


The interaction sum of squares is found as SS(AB) = SSE, — SSE}, which is 
Dis; 7;;¥ij. [the same as the last term shown in Equation (6A.8)]. 

The equivalence of SSE, — SSE¥s to 44;45;77;,;yij, can be shown by solving the 
normal equations for the reduced model and computing the reduction in sum of 
squares due to fitting the reduced model as R(,a, 8). The normal equations for 
the reduced model are obtained from those for the full model in Equation (6A.5) by 
removing the equations for +;; and the 7,; terms in the remaining equations. The 
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solutions for 7, @;, and ip will be those shown in Equation (6A.6). The reduction 
in sum of squares due to fitting the reduced model will be 


a b 
R(u, a, B) = fy... + Says. + "Biv (6A.10) 
i=l j=] 


The difference between R(, a, 9, y) and R(p, a, 9) is seen to be Mi da5 Vig Vis., 
and, therefore, the differences between the sums of squares for experimental error 
of the two models will be equivalent to the same quantity. That is, 


SS(AB) = SSE, — SSE; = R(y,a, 6,7) — R(u,a, 8)  — (@A.11) 


The sums of squares SSA and SSB can be derived in a similar fashion for 
balanced data, or SSA = R(y,a) — R(w) and SSB = R(p, 8) — Rp). 


Unequal Treatment Replication 


The derivation of the interaction sum of squares with unequal treatment replication 
in the factorial treatment design is demonstrated with the simple 2 x 2 factorial 
example shown in Table 6A.1. 


Table 6A.1 Example data for a 2 x 2 factorial with unequal treatment replication 


Full Model 


The full model normal equations for the example data in Table 6A.1 are de- 
rived by the methods shown at the beginning of this appendix. The coefficients for 
the parameters in the equations will reflect the unequal replication numbers. In gen- 
eral, the equations will be 


Go WN 


N 


N es) 
=> ®» » ® B® ® 
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a 


b b 
yw CO NGi+ Tr, 03 + ee i es a TijVij = V... 


b b 
a: m+ rGit+ > ryBy t+) ry = Vi. 
j=1 IA 
a a 
By rf + S yO + 738; + S Tif Vig = Y.3. 
i=1 wat 


Visi Tight + igQi + 738; + Tigi = Vi. 


The normal equations for the data in Table 6A.1 are 


HSGypasnh8Gp +) SPyocho 8G; 08 B= He Pyotr 27, + Fy 
+ 5a) + 1, ee ty tH, = 
ee 26, B, - ey Tag = 
+ 265-2 205 + 5B, + 3, + 2704 = 

+ 24, + @ + 3B, + 2719 + Vo 
+ 301 = 4b; = aly = 
+ 2a + 2B, + 2719 Ss 
+ 2 + 28, + 27% = 

a2 +e + oo 


Since there are linear dependencies in the equations the “sum-to-zero 
constraints” can be imposed on the equations to obtain a solution. The constraints 
are 


Q@, + @2 = 0, Bp, + By =9, Fatt Fa + Vio = 9, Vu + Yo, = 0 
Yor + Yo, = 90, and Fo + Yoo = 0 


The solutions to the equations after the constraints are applied are 


ee ee 19 
Ste a ee 8 8g 
ee a Se 


The sum of squares for experimental error can be determined with 


SSE; = DRE Vix — R(w, a, 8,7) 


where 
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R(y, a, By) = By... + OY... + 2 Bis. = ONY GVas 


The calculation for R(z, a, @,y) is 


R(u, a, B,Y) = LY... “3 Q1Yy,. = Q2Y2.. -— Bry. + Boy 2, 
+ F411. + Vioy12. + Yor Y21. + Yoo422. 
1 
= 54 19162) + 1(20) + (— 1)(12) + 19(23) + --- +(— 19) + 18)] 


== 132,833 


Given Xy;., = 140, the sum of squares for experimental error from the full model 
is 
SSE = 140 — 132.833 = 7.167 


Reduced Model 


The normal equations for the reduced model without the +,;; interaction terms 
are obtained by eliminating the equations for the +,;; and eliminating the 7,; terms 
from the remaining equations shown for the full model. The equations for the re- 
duced model are 


Si + Sa, + 3, + 56, + -38, = 32 


Si + 5a) + BOy He eRames . 20 
37 toe + 26, + 6 = 2 
St +; Say =e Bese & 58) = 33 
Fi. Vmday =F Op + 35, = 9 


The sum-to-zero constraints are @| + @_ = 0 and Bi + 6B, = 0. The solutions 
are 


Se = ae = eo ae | 
p= Or See Pe eg 


56 56 36" 


45 
56 


The reduction in sum of squares due to fitting the reduced model is 


R(y, a, B) = fy.. + Gyr. + Goy2.. + Biya. + Boys. 
] 


= 5, (2122) + 3(20) + ( — 3)(12) + 45(23) + (— 45)(9)| 


= 132.821 
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The sum of squares for experimental error for the reduced model is 


SSE, =D BS Vink — R(p, a, 8) = 140 — 132.821 = 7.179 
a) 


; 07 


The sum of squares for interaction is 


SS(AB) = R(u, a, 8,7) — Ru, a, B) = 132.833 — 132.821 = 0.012 


SS(AB) = SSE, — SSE; = 7.179 — 7.167 = 0.012 


Main Effect Sums of Squares 


Tests of equality for marginal means for A and B, Ho: ff, =f,and Ho: 
}i, = f£9, in the absence of interaction require the sums of squares partitions from 
the method of weighted squares of means (Yates, 1934). Some computer programs 
that compute the required sums of squares were indicated in Section 6.9. The hy- 
pothesis tested by the sum of squares partition for a main effect depends greatly on 
the computational technique used in the least squares estimation process. Details of 
the results from different techniques can be found in Hocking and Speed (1975), 
Speed and Hocking (1976), Speed, Hocking, and Hackney (1978), and Searle, 
Speed, and Henderson (1981). 


/ Factorial Treatment Designs: 


Random and Mixed Models 


The subject of variance component analyses is expanded to more complex designs 
in this chapter. Variance component models are developed for several variations on 
the factorial treatment design. Random effects models and models with mixtures of 
fixed and random effects for factorial arrangements are introduced in this chapter. 
The concept of the factorial treatment design is extended to include experiments 
with factors nested within other factors. Designs are discussed for experiments that 
have a combination of crossed and nested factors. Information for determining repli- 
cation numbers is included in the discussions. Rules are given for deriving expected 
mean squares for a variety of balanced factorial experiments. 


7.1. Random Effects for Factorial Treatment Designs 


232 


Random effects were introduced in Chapter 5 for studies in which the effects of the 
treatment factor were random samples from a population of treatment effects. The 
objective was to decompose the total variance into identifiable components. The 
variability caused by one source or factor can depend on the conditions under 
which it is evaluated. Thus, some of the total variance is associated with the interac- 
tion between two or more factors. The following example illustrates the interaction 
variance between two factors. 


Example 7.1 Evaluating Machine Performance with Variance 
Components 


A manufacturer was developing a new spectrophotometer for use in medical 
clinical laboratories. The development process was at the pilot stage of 
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assembly after which machine performance was to be evaluated: from assem- 
bly line production. 


Research Question: A critical component of instrument performance is the 
consistency of measurements from day to day among machines. In this partic- 
ular instance, the scientist who developed the instrument wanted to know if 
the variability of measurements among machines operated over several days 
was within acceptable standards for clinical applications. 


Treatment Design: The scientist set up a factorial treatment design with 
soa eS _and “days” as factors. F ee eee a esr og ae 


sinad x 


ron rents. Two serum iisampleste were andor sacra to 
piel of the four machines on each of the four days for a completely ran- 
domized design with two replications of each treatment combination. The 
same technician prepared the serum samples and operated the machines 
throughout the experiment. The observations on triglyceride levels (mg/dl) in 
the serum samples are shown in Table 7.1. | 


Table 7.1 Triglyceride levels (mg/dl) in serum samples run on four machines on 
each of four days 


Machine 
Day | 2 3 4 
1 142.3, 144.0 148.6, 146.9 142.9, 147.4 133.8, 1332 
2 134.9, 146.3 145.2, 146.3 125.9, 127.6 108.9, 107.5 
3 148.6, 156.5 148.6, 153.1 135.5, 138.9 132.8 1497 
4 152.0; 151.4 149.7, 152.0 142.9, 142.3 141.7, AZ 


Source: Dr. J. Anderson, Beckman Instruments, Inc. 


rs because they repre random sample 


frome een tlitio ion ah nee eens the machines could be run. The 
factorial eames! enabled evaluation of interaction between Machines and 
Days. The consistency of machine performance would be evidenced by the absence 
of interaction. | 


Statistical Model for Variances with Two Treatment Factors 


Variability due to the interaction of random factors can play an important role in 
the inferential process. A random effects model for the two-factor experiment in a 
completely randomized rie reas is 
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Yigk = b+ aj + by + (ab)ij + esjx (7.1) 
te dies Lonel Me Metasnnds Rackets ae 


The random effects a;, b;, and (ab);; are assumed to be independent and normally 
distributed with means of 0 and variances o7, o7, and o7,, respectively. The effects 
are assumed to be independent of one another. The random errors e;;;, are assumed 
to be independent and normally distributed with mean 0 and variance o”. 

The observations y;,;, in the random effects model have a normal distribution 
with mean y and variance 


sie 2 2 2° 
oy > o~ + On + 0; ey (7.2) 


The components of variance in Equation (7.2) become the focus of any investiga- 
tion with random effects. 

The factorial analysis of variance for the measured concentration of tri- 
glycerides in the serum for the spectrophotometer tests described in Example 7.1 is 
shown in Table 7.2. The analysis of variance computations are those presented in 
Chapter 6. The expected mean squares are included in the analysis of variance 
table. A complete set of rules for expected mean square determination applicable to 
several types of factorial models, including random effects models, is given in Sec- 
tion 7.6. 


Table 7.2 Analysis of variance for spectrophotometric readings from four 
machines on each of four days 


Source of Degrees of Mean Expected 
Variation Freedom Square } Mean Square 
Day | 3 MSD = 445 o*+ro7, +rbo% 
Machine 3 MSM = 549 a? + ro4,, + rao?, 
Interaction 9 MS(DM) = 87 O*7 +75, 

Error 16 MSE =18 e 


Point Estimators for the Variance Components 


With equal numbers of observations for each treatment combination the analysis of 
variance method discussed in Section 5.3 can be used to estimate the components 
of variance. The analysis of variance is computed for the factorial experiment just 
as it is for the fixed effects model. The estimates of the components are determined 
by equating the observed mean squares to the corresponding expected mean 
squares and solving for the unknown component values. The estimates for the four 
components of variance in Table 7.2 are 
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Error: G = MSE = 18.0 
5 MS(DM)-MSE 87-18 
Interaction: Cm = Fo. = 45 
is 2 
a5 MSM—-MS(DM) 549-87 
Machines: OS UF Pr swSH78 
ra 2(4) 
9 ° MSD MS(DM)° 445-— 87 
Days: o> = = —_ = 14 8 


rb 2(4) 
The estimate of total variation for a single observation is 
G, =O +64 407, + Gam = 18.0 + 44.8 + 57.8 + 34.5 = 155.1 


and the estimated standard deviation is@, = 12.5. 


Tests of Hypotheses About the Variance Components 


The significance of the contribution by the components for Machines, Days, and 
their interaction can be assessed with the F' test. The denominator for the Fo statis- 
tic is the mean square with the same expectation as the numerator mean square 
under the null hypothesis. The test for no interaction, Ho: 07, = 0, requires that the 
mean square for error be used for the denominator since it will have the same ex- 
pectation as 1/S(DDM) under the null hypothesis. The statistic is 


Sse AE ———_ 4.83 
“Moe §©=—s«d'B 


and the null hypothesis of no interaction is rejected with Fo > F'o5,9.16 = 2.54. 

The situation is different for tests involving components of variance for main 
effects. The expected mean squares for main effects are equal to the expected mean 
square for interaction when the null hypothesis is true. Therefore, the correct Fo 
statistic to test Hp: 07 = 0 is 


MSD 445 


*0= MfS(DM) ~ 87 


ap | 
and the null hypothesis for the Days component is rejected with Fo > F'o5,3,9 = 
3.86. Likewise, the statistic to test Hp: 07, = 0 is 


MSM 549 


=  - — = 31 
ViGiD AA ireRis a 


Fo 


and the null hypothesis for the Machines component is rejected with Fo > 
F'95,3,9 = 3.86. 
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An interval estimate for 0? can be calculated as shown in Section 5.4 since 
SSE/o? is a chi-square variable with ab(r — 1) degrees of freedom. 


Interpretations of the Variance Component Estimates 


Each of the components contributes significantly to the variation of a measure- 
ment from this particular model of spectrophotometer. The Error component, 


oS? 


= 18.0, represents ee variation in preparation of the serum samples. The 
Machines component, o. = 57.8, is the variability in machine performance and 


contributes 37% of the variation. The Days component, G4 = 44.8, is the varia- 
bility associated with a new start-up utilizing new reagents for the analysis of sam- 
ples and other sources of variability that can be = with day-to-day 


operational differences. cei ee oae Sam = = 34.5, contributes 22% 
= lode total variation. The si nifica nt interé 1 implies that the relative perform 
ance of the several machines vary SHERRIE? WEE the OLY to day 
ma ;in the operation. An — in the calibration of the machines from 
bees ne a could be one possible explanation of the interaction. 

The factorial design has made it possible to identify several sources of vari- 
ability in the measurements made by this model of spectrophotometer. The inves- 
tigator, based on experience, will be able to decide if any of the contributing 
sources of variability exceeds an acceptable level and correct any deficiencies in 
the machine or operating conditions if necessary. 


ractic mn 


Variance Components for Three-Factor Studies 


The expected mean squares for a three-factor experiment with random factors are 
shown in Table 7.3. Some complications arise in the construction of Fo statistics for 
tests of hypotheses about the components of variance in random models with more ~ 
_ than two factors. The mean square for error can be used to test the hypothesis of no 
three-factor interaction, and the mean square for three-factor interaction, 
MS(ABC), can be used to test hypotheses for two-factor interaction components. 
However, upon inspection of the expected mean squares in Table 7.3 it can be seen 
that there is no legitimate mean square for the denominator of the Fo statistic to test — 
null hypotheses about the main effect variance components. 


Approximate F Tests Required for Some Hypotheses 


It is necessary to construct a mean square for the denominator of the Fo statis- 
tic to test the significance of main effect variance components. The construction of 
approximate fo statistics was discussed in Chapter 5 with unequal subsample num- 
bers. For example, to test the hypothesis Ho: oR = 0, there are two possible Fo sta- 
tistics that approximate the F' distribution. 

The first approximation is constructed with MSB as the numerator and 
M= MS(AB)+ MS(BC) — MS(ABC) as the denominator. The second ap- 
proximation uses MI1=MSB+MS(ABC) as the numerator § and 
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Table 7.3. Expected mean squares for a three-factor experiment with random 


effects 
Source of Degrees of Expected 
Variation Freedom Mean Square 
A a,— | a7 + ab se had + rba2, + rbco? 
B btend a7 + abc + rco*, + ee: + raco; 
C | o*+ro*,.+rbo2, + raoz, + rabo? 
AB (a — 1)(b— 1) o7 + ab + rco?, 
AC (a — 1)(c — 1) o? + "Fab + rbo?, 
BC (b—1)(c — 1) o7 + ab + rao}, 
ABC (a — 1)(6 — 1)(e — 1) o*? + ro?,. 
Error abc(r — 1) o? 


M2 = MS(AB)+ MS(BC) as the denominator. The first ratio may be easier to 
use because approximate degrees of freedom need be computed for only one of the 
mean squares by the Satterthwaite procedure (Equation 5.27). However, it is 
possible to construct a negative mean square when some of the mean squares have 
negative signs in the function. Gaylor and Hopper (1969) discuss some of the 
problems associated with approximating the Ff’ distribution with linear 
combinations of mean squares. The recommended Fo statistic would be the second 
with synthesized mean squares M1 and M2. 


7.2 Mixed Models 


Many experiments are ¢ designed to study the effects of one factor on the nepalatee 
mean and the effects of another on the population variance. These experiments 
gence ca ca = . and. random factors. Mi Node! piebiorial arrange- 


cause e they contain a mixture of Padi and fixed effects. 
The model and analysis for mixed (aol consist oF two parts because there are 


variat ition in a Be pulation of effects whereas Lhe inferences for the fixed effects fac- 
tor are restricted to the specific levels used for the experiment. The experiment de- 
scribed in the following example included a mixture of random and fixed factors. 


Example 7.2. Evaluation of Two Chemistry Methods on Four Days 


New methods of chemistry frequently are developed to assay for compounds 
in a clinical laboratory setting. Given the choice among two or more chem- 
istry methods, the clinical chemist must evaluate the relative performance of 
the methods. 
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Research Question: In this example, a chemist wanted to know whether the 
two chemistry methods consistently provided equivalent results in an assay 
for triglycerides in human serum. A methods comparison test was constructed 
by the clinical chemist to evaluate the difference in the performance of the 
two chemistry methods for the assay. 


Treatment Design: A factorial treatment design was used with the factors 


“chemistry methods” and “days.” The two methods each were to be tested on 
four days, for 1 eageatseneed 


Experiment Design: Each day two replicate samples of serum were prepared 
with each of the two chemistry methods. The samples were analyzed in 
random order on the same spectrophotometer for a completely randomized 
experiment design with two replications. The same technician prepared the 
serum samples and operated the spectrophotometer for each of the tests. The 
observations on triglyceride levels (mg/dl) in the serum —- are shown in 
Table 7.4. 


Table 7.4 Triglyceride levels (mg/dl) in serum samples from a factorial 
arrangement for two chemistry methods, fixed factor, and four days, random factor 


Day 
Method 2 3 4 
1 142.3, 144.0 134.9, 146.3 148:6/996.5 152.0, 151.4 
2 1429, 1474 2235.9,127.6 . 1503, tee a9, 142.5 


Source: Dr. J. Anderson, Beckman Instruments, Inc. 


| The inferences are restricted to a comparison between the 


two methods used in the experiment. * 


The advantage of the factorial arrangement in this example is the ability to 
evaluate : the consistency of the two methods under repeated ins. In the absence of 
interaction the two methods would produce the same relative results from day to 
day. The presence of interaction would indicate one or both of the methods were 


inconsistent in repeated applications. 


Statistical iWiodel for One Fixed and One Random Factor 


The linear model for a two-factor experiment with a fixed effect, factor A, and a 
random effect, factor B, is 


Yigk = + a4 +b; + (ab)ij + eijK (7.3) 


pe l2.38 oS a eee 
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where yp is the mean, a; is the fixed effect for factor A, b; is the random effect for 
factor B, (ab);; is the interaction effect, and €;;; 1s random experimental error. The 
random effects, b; and e;;,, are assumed independent and normally distributed with 

means 0 and variances o and o”, respectively. 
The interaction effects (ab);; are assumed to PS. rendon subkcled independent 
The ypiterae action effects are 


Analysis of Mixed-Factor Experiments 


The expected mean squares for the mixed-model analysis of variance are different 
io tied oe Ear bea Se ke fixed or Soar ean! random effects models. 

he expected mean squares and the analysis for the factorial experiment with one 
ando manne ot one ined Saar are STHiseated I for he chemistry method experiment in 
Example 7.2. The analysis of variance for the 16 observations is shown in: Table 
7.5. The expected mean squares are included in the analysis of variance table. Note 
the use of 6?, for the variance of the fixed effect as defined in Table 6.5. 


Table 7.5 Analysis of variance for a factorial experiment with one fixed effects 
factor, Method, and one random effects factor, Day 


eee Drees) —<“<—~*SMWMcag = Expected 
Variation Freedom Square Mean Square 
Method MSM = 329 a + ro, + rhe, 
Day 3 MSD = 144 o* +ro2,, + rac; 
Interaction 3 MS(MD) = 62 o*+ro?, 
Error 8 MSE = 14 a 


Source: Dr. J. Anderson, Beckman Instruments, Inc. 


Tests of Hypotheses About Variance Components and Means 
The null hypothesis of no interaction, Hp: o7,, = 0, is tested with 


it\| MS(MD)  @ _.,, 
ol MSE ee 


and the null hypothesis is rejected since Fo > F'o5.3.3 = 4.07. 


omparisons can be made between chemistry 
gees within each day with contrasts among cell means. 

test of ‘the null hypothesis of no difference between the marginal means of 
the fi fixed effect factor would only be appropriate in the absence of interaction. For 
illustrative purposes only, the Fo statistic for Chemistry Methods is 
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M 329 
Dial ee Ot 


SU MSC. 


and the null hypothesis would not be rejected since Fo < F'951.3 = 10.13. The 
observed means for the two chemistry methods were y7, = 147 and y, = 138 © 
mg/dl. 
The Fo statistic to test the significance of the variance component for Days, 03, 
iS 
MSD 144 


~ MS(MD) 62 ~~ ek 


Fo 
and the null hypothesis Ho: of = 0 would not be rejected since Fo < F'o53.3 = 
9.28. 


Standard Errors for the Fixed-Factor Means 


The standard error for the difference between the two chemistry methods on a 


given day, 
/}2(02 + ro?) 
O95. —Trj.) 2 r (7.4) 


ee eM DD) 2(62) 


ee -! ae eres |e 
a oa Ba 
pe pe 


Say 3 Se Te 


As a general rule the mean square used for the standard error of a difference 
between the. means of a fixed effect factor is the denominator mean square of Fo 
used to test the null hypothesis about the fixed effect. Thus, the standard error of 
the difference between the marginal means of the two chemistry methods is esti- 


mated by 
2MS(MD) 2(62) 
a ee 24) = 3.9 (7.6) 


Three-Factor Experiments with Random and Fixed Factors 


is estimated by 


Tis 
Tae 


es 


The problem of constructing F statistics for tests of hypotheses was discussed in 
Section 7.1 for the three-factor experiment with three random factors. Similar diffi- 
culties occur with the mixed effects model for experiments with two or more ran- 
dom factors. The expected mean squares are given in Table 7.6 for three-factor 
experiments that have either one or two random effect factors with the remaining 
factor(s) fixed. 
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Table 7.6 Expected mean squares for three-factor experiments with (1) one fixed 
effect factor and two random effect factors and (2) two fixed effect factors and one 


random effect factor 


Expected Mean Square 
Source of . 
Variation A Fixed, B and C Random A and B Fixed, C Random 
A a7 + "abe “F reap + rbo?.+rbc0? 0? +170? abe + rba?, + rbcé? 
B a7 + "ab - ria 7 ele Ks Taco? a7 + Oa = rad; ag reel 
SS o7 + "Pa — poe + raoz,+rabo2 o*% +ro?,,+rbo2, + raoz, + rabo? 
AB o? + "ab + rco?, a? + "ab + rc62, 
AC o? + ro?,, + rbo2, o* + "abe + + rbo2, 
BC o* +ro*,, + raz. o* + Oa + rag}, 
ABC o*? + ro*,. o* + ro, 
Error a o* 


The difficulties that are encountered in the construction of Fo statistics for 
some hypotheses are immediately apparent upon inspection of the expected mean 
squares. In some instances, there is no legitimate mean square available for the 
denominator of the Fo statistic among the existing mean squares. The mean squares 
for the Fo statistic must be synthesized as discussed in Sections 5.11 and 7.1. 


Alternative Mixed Model with Restrictions on the Interaction 


There are several versions of the mixed model based on the definition used for the 
interaction effects; ace Bocas. (1973, 1985) and Searle et al. (1992) for a techni- 


cal discussion. The alternative model places a constraint on the interaction effects. 
The model is 


Yigk = Wr OG + G5 + (ag)ig + ijk (7.7) 

Ss eee os eee es ee 
where yz is the mean, a; is the fixed effect for factor A, g; is the random effect for 
factor B, (ag);; is the interaction effect, and e;,;, is random experimental error. The 


random effects, g; and e;;,, are assumed independent and normally distributed with 
means 0 and variances o? and o”, respectively. The interaction effects (ab);; are 


g 
assumed to be random effects and normally distributed with mean 0 and variance 
(a-}) 2 


a Cag" 


ctors, | » ais is is feed ane alternative model has the interaction 


= (a9); eee =O" 
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With this model, interaction effects summed over levels of the random factor 


Yagi = (ag);, will not be equal to zero, because they represent only a random 
sample of the interaction effects at each level of the fixed factor. However, at any 
given level of the random factor there is a finite set of interaction effects equal to 
the number of levels for the fixed factor, and the summation, }** (ag);; = (ag).;, is 
equal to zero. Consequently, there is a covariance between two interaction effects at 
the sae level of the random effect and different levels of the fixed effect, which is 
_ 1 a, . For example, with the experiment in Example 7.2 observations made on 
the : ae day with the two chemistry methods will be correlated, but observations 
on the same chemistry method on two different days will not be correlated. 

Thus, the primary distinction for the alternative model is the presence of corre- 
lation between the interaction effects. As a consequence, the expectations of some 
mean squares are different with the alternative model. For example, with two fac- 
tors, A fixed and B random, the expected mean squares are 


E(MSA) = 0? +10? — rb@? 


= - 
1 lasB iG Gro ty (a0) E(MSB) = 0? + rav?+O (7.8) 


E[MS(AB)] = 0? + r0%, 


E(MSE) = 0° 


ference in t -expected m - r ire for 
om Ce ye consi derable 2 limplaee on FSSA oe For 
example if the restricted moder’ wee ised for Example 7.2 a test of the hypothesis 
Ho: 02 = 0 would require the test statistic Fy = MSD/MSE = 144/14 = 10.29 
and the null hypothesis would be rejected with Fo > F'o5,3.3 = 4.07, which is just 
opposite of the conclusion with the original test for the model without restrictions 
on the interaction. , 
Hocking (1973) discusses the relationship between the two models and shows 
the relationship between the variance components for the two models to be 


l 
co? =o, + ~ ab (7.9) 


we b; ai Cijk are sinpaneiied random ee with variances op and o7, respec- 
tively. There was no assumption that the sum of (ab);; over levels of the fixed 
factor }~; (ab);; = (ab),; is equal to zero. 
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Therefore, a reasonable choice must be made with regard to the model that is 
The model without restrictions on 
ne major einen ine that th - expected mean squares 
for a } ata are consistent with the unrestricted model (Hartley & Searle, 
1969). The restricted model is not considered in the unbalanced case. If there is a 
possibility of correlation between effects of a fixed factor for a given level of the 
random effect, and data are balanced, then the restricted interaction model may be 
appropriate. If not, or if data are unbalanced, then the model discussed originally in 
this section with no correlation among the interaction effects is most appropriate. 


7.3 Nested Factor Designs: A Variation on the Theme 


The standard factorial treatment design has two prominent characteristics: Each 
level of every factor occurs with all levels of the other factors, and the interaction 
among factors can be examined. 

In certain types of studies the levels of one factor, B, will not be identical 
across all levels of another factor, A. Each level of factor A will contain different 
levels of factor B. 

The levels of B are said to be nested within the levels of A. The designs are 
referred to here as nested factor designs. They are also called hierarchical 
designs. The following example illustrates a design to study components of vari- 
ance for nested factors. 


Example 7.3 Glucose Standards in Clinical Chemistry 


Clinical laboratories perform patient serum assays critical for correct medical 
diagnoses. The laboratories maintain quality control programs to monitor the 
performance of assays and ensure the physician is receiving accurate infor- 
mation for diagnosis. 

The important sources of variation in the assays are days on which the as- 
says are conducted, the replicate runs within days, and the replicate serum 
sample preparations within runs. The quality control program requires that a 
spectrophotometer be tested with several runs on each of several days with 
serum standards used in the laboratory for control runs. Replicate serum 
preparations are evaluated within each of the runs. 

The data in Table 7.7 are observations from a design used for quality con- 
trol on the analysis of glucose standards. Glucose standard serums are main- 
tained in the laboratory specifically for quality control runs. There were c = 3 
replications of the standard prepared for each of b = 2 runs on each of a = 3 
days. 

The design is a nested design with two independent and unique runs on 
each of three days. The nesting of runs within days occurs because a run on 
any one day has no relationship with a run on any other day. For example, the 
first run on day | has nothing in common with the first runs on days 2 or 3. 
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The runs are numbered 1 through 6 in Table 7.7 to reflect their independence 
from one another from day to day. 

In the same manner the replicate serum preparations are nested within 
runs. The Days, Runs, and Replicates represent factors at the first, second, 
and third levels of the hierarchy. 


Table 7.7 Glucose (mg/dl) in quality control standards 


Day | Day 2 Day 3 
Run1I Run2 Run3 Run4 Run 5 Run6 
42.5 42.2 48.0 42.0 41.7 40.6 
43.3 41.4 446 42.8 43.4 41.8 
42.9: ..:418 AF i ALS 42.5 41.8 
Day mean (9; _) 42.4 44.0 42.0 


Source: Dr. J. Anderson, Beckman Instruments, Inc. 


Other Examples with Nested Factors 


Consider a genetics study involving animals wherein each sire (male parent) is 
mated to a random sample of dams (female parents), and each mating results in a 
litter of several offspring as shown in Display 7.1. The Sires, Dams, and Offspring 
represent factors of interest in the study. There are different dams for each sire, and 
the dams are nested within sires. The offspring or progeny from a dam are different 
from those of other dams, and the progeny are nested within dams. The factors in 
the nested design form a hierarchy. The hierarchy or nesting of the factors is illus- 
trated in Display 7.1. The uppermost level of the hierarchy represents sires, fol- 
lowed by dams and progeny in the second and third levels, respectively. 


Display 7.1 Nested Factor Design in a Genetics Study 


There are b = 3 different dams mated to each of the a = 2 sires for a total of 
ab = 6 dams, and there are c = 2 progeny per dam for a total of abc = 12 progeny. 

Nested designs occur in education research that utilizes several elementary 
schools. The classrooms are nested within the schools, and the students are nested 
within the classrooms. 
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An experiment on fabric dye formulations requires that several replicate 
batches of each formulation be independently mixed and each batch tested on 
several specimens of a common fabric. The replicate batches are nested within dye 
formulations, and fabric specimens are nested within batches. 


Statistical Model for Nested Factors 


The factors in the nested design hierarchy can be fixed or random factors. The de- 
sign for glucose standards in Example 7.3 would have all random factors if days, 
runs, and replicate serum preparations were considered random samples of their 
respective populations. | 

The linear model for a nested design with three random nested factors—A, B 
within A, and C’ within B—is 


VYijk = UG; sar ey + Cj) (7.10) 
imclids yaathien ta eee ihe’, 2,...,¢ 


where a; is the effect of factor A, b,,;) is the effect of factor B nested within A, and 
Cy(ij) 1s the effect of factor C’ nested within B. The subscript 7(z) refers to the factor 
represented by the 7 subscript nested within the factor represented by the 7 sub- 
script. The effects a;, b,;), and c,;;) are assumed to be independent random effects 
with means 0 and variances 0”, Oi(4)> and o(8)> respectively. 

Factor A is a fixed effect in the genetics study of Display 7.1 if only two sires 
are available for the study and the investigator wants to restrict the genetic results 
to those two sires. The dam effects are random if they represent six dams randomly 
selected from a potential population of dams. If factor A effects are fixed, then the 
random effect a; shown in Equation (7.10) is replaced by the fixed effect notation 
OX. 

Fixed effects for both factors A and B can occur in the study. Suppose a large 
metropolitan area fire department wants to evaluate the effect of two different fire 
station crew rotations on crew efficiency in its six fire districts. One rotation is 
tested in three of the districts chosen at random, and the other rotation is tested in 
the remaining three districts. A random sample of crews is tested in each district 
after the evaluation test period. The rotation factor, A, is fixed because only two 
types of crew rotations are under consideration. Likewise, the district factor, 2, is 
fixed because only the six districts occurring in the metropolitan area are under 
consideration. If factors A and B are fixed, then the random effects a; and b,,;) are 
replaced by fixed effects a; and (,;) in Equation (7.10). 


Analysis for Random Nested Factors 


The objectives of studies that utilize the random effects nested design depend on 
the subject matter of the study, but estimation of the components of variance and 
tests of hypotheses about the components are involved. The analysis of variance 
sums of squares partitions are used for both the estimation and testing procedures. 


246 CHAPTER7 FACTORIAL TREATMENT DESIGNS: RANDOM AND MIXED MODELS 


The computations for the analysis of variance are identical to those for subsampling 
as shown in Table 5.5. The expectations for the mean squares with all effects ran- 
dom are given in the abbreviated analysis of variance outline shown in Table 7.8. 


Table 7.8 Expected mean squares for the analysis of variance of a nested design with 
three random factors, A, B, and C’ 


Source of Degrees of Mean Expected 
Variation Freedom Square Mean Square 
Total abc — | 

A a — | MSA On) + COi/4) + bea? 
B within A a(b — 1) MS(B/A) ob) + COiva) 

C within B ab(c — 1) MS(C/B) ob) 


Point Estimators for Variance Components 


The analysis of variance estimates of the variance components are found by 
equating the observed mean squares to the expected mean squares and solving for 
the components of variance. The three estimators for the analysis in Table 7.8 are 


65) = MS(C/B) 
2 [MS(B/A)—- MS(C/B)| 


(  ——— (7.11) 


and 


~_, |MSA-— MS(B/A)| 
hire bc 
The null hypotheses of interest are Ho: 0? = 0 if the a; are random effects and 
He: — hy, =} =" = fh Or fixed efiecis with, fe 1 Statistic fy = 
MSA/MS(BIA). The hypothesis Ho: oj.) =0 is tested with [y= 
MS(B/A)/MS(C/B). 


Analysis of Glucose Quality Control Standards 


The analysis of variance for the glucose standards in Example 7.3 is shown in 
Table 7.9. Many computer programs will print a table of expected mean squares 
and variance component estimates upon request. The coefficient for the Day 
component (a2) is bc = 6, while the coefficient for the Run/Day component is 

mm 3: 
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Table 7.9 Analysis of variance for glucose quality control standards - 


Source of Degreesof Sumof Mean 

Variation Freedom Squares Square F Value ag sae 
Total 17 43.88 

Day 2 13.76 6.88 1.26 400 
Run/Day 3 16.36 5.45 4.75 02] 
Rep/Run 2 13.76 1:45 


The estimates for the components of variance are 


Reps: E26) = MS(Rep/Run) = 1.15 


33 MS(Run/Day) — MS(Rep/Run) (5.45 — 1.15) : 
Run: 9 b(a) = a ee — Ta, eas — i, =) 
MSDay — MS(Run/D 6.88 — 5.45 

a pe 


. bc 6 


The estimate of total variance for a glucose standard analysis is 


GC, = G7 + Gia) +24) = 0.24 + 1.43 + 1.15 = 2.82 
with standard deviation o, = 1.68. Approximately 9% of the variation is attributed 
to day-to-day variation, e: 51% to run-within-day variation, Gia); and 41% to 
replicates-within-runs variation, G20): The grand mean of the study was 7 = 
42.8, and the percent coefficient of variation for the glucose standards in this set of 
runs was %CV = (100)(6,/y_) = (100)(1.68/42.8) = 3.9%. 


Standard Errors for Means 


The variances for the grand mean for the study y and a day mean 7, , are 


o% F) , Pb(a) 
“a d of sweet! 4+ 7.12 
abc ab a es OT. bc 3 b ( ) 


respectively. The estimates are 


SOUR ay = 6.88 SS 
t ghes 2 


S 0.38 


and 
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ee MS(Run/Day) 5.45 


Yi. bc ee (2)(3) se 


The standard error estimates are sy = ,/0.38 = 0.62 andsz = ,/0.91 = 0.95. 
The variances, standard deviations, standard errors, coefficient of variation, 
and means are all useful statistics for quality control monitoring. Cumulative 
records of these statistics are often maintained in a laboratory and inspected regu- 
larly. If the values deviate from some established norms the analyst is alerted to the 
fact that the process must be investigated for the source or cause of the deviation. 


Tests of Hypotheses About Variances 
The test of the null hypothesis for the day component of variance is 


M SDay 6.88 ae 


ha MS(Run/Day) TAS 


and it is not significant with Pr > F = .400 (Table 7.9). The test of the null hy- 
pothesis for the run-within-day component of variance is 


MS(Run/Da 5.45 
( y) = —— =4,75 


o> Medepenny 1.15 


and it is significant with Pr > F = .021 (Table 7.9). These results indicate the 
analyst could most effectively increase precision by concentrating efforts to reduce 
the variation among runs on the same day and replicates within runs to obtain more 
precise estimates. 


Analysis with Fixed-Factor Effects 


The Fp ratios for tests-of hypotheses with fixed effects in the nested design can be 
determined from the expected mean squares shown in Table 7.10. The fixed effects 
are defined such that 4; a; = 0 and &1; 6,;) = 0 fora = 1,2,...,a. 


Table 7.10 Expected mean squares for the analysis of variance of a nested design 
with fixed and mixed effects for A and B 


Mean 

Square A and B Fixed A Fixed and B Random 
MSA O16) + beXia?/(a — 1) O16) + C054) + beXia?/(a — 1) 
MS(B/A) O21) + cd B+ )/alb — 1) 2b) + C04) 

MS(C/B) o-() o2() 


With A fixed and B random the null hypothesis for factor A effects, Ho: 
Q, = a2 =": = Aq, is tested with Fy = MSA/MS(B/A). When both factors 
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are fixed the null hypothesis for factor A_ effects is tested with 
Fo = MSA/MS(C/B). The null hypothesis for factor B effects with both 
factors fixed is Ho: Py) = B:) =--- = Po) for all 2, and it is tested with 
Fo = MS(B/A)/MS(C/B). 


Staggered Nested Designs to Equalize Information About Variances 


The nested factors design contains more information on factors at lower levels in 
the hierarchy of the design than at higher levels. In larger studies the discrepancies 
in degrees of freedom among sources of variation can be considerable. Staggered 
nested designs were developed to equalize the degrees of freedom for the mean 
squares at each level of the hierarchy. 

The staggered designs have unequal numbers of levels for factors that are 
nested within other factors. The levels for factor B nested within factor A vary 
from one level of factor. A to another in such a way that the degrees of freedom for 
MSA and MS(B/A) are more equal. Likewise, the levels for factor C’ nested 
within factor B can vary across levels of factor B to achieve degrees of freedom for 
M S(C/B) similar to that for the other mean squares. 

Anderson (1960) and Bainbridge (1963) gave some of the early results on the 
use of staggered designs. Smith and Beverly (1981) provide a general discussion on 
the use and analysis of staggered designs. Goldsmith and Gaylor (1970) enu- 
merated 61 staggered designs for three stages of factors (A, B, and C), such that all 
three variance components can be estimated by the analysis of variance method. 
The designs enumerated by Goldsmith and Gaylor included two or three of the five 
possible fundamental structures in a staggered design shown in Display 7.2. 


Display 7.2 Five Fundamental Structures for Three-Stage Staggered 
Nested Designs (Levels of B within A are b; and the levels 
of C within B are c;;) 


Structures 


The analysis of a three-stage staggered design follows the pattern established 
for the analysis with unequal subsamples and replications in Table 5.8. Leone et al. 
(1968) gave computational formulae for the four-stage nested designs with unequal 
levels at all stages. They provided formulae for the analysis of variance sums of 
squares, expected mean squares, and estimates of variances for the means at 
various stages of the design (yY_,Y;, ¥i;,, andY;,,.). Gates and Shiue (1962) 
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provided computational formulae for the analysis of variance sums of squares and 
expected mean squares for a general S-stage hierarchical classification. 


Example 7.4 A Staggered Design for Soil Samples 


The data in Exercise 5.8 were generated from a staggered nested design to 
estimate components of variance for characteristics of soil samples. The three 
factors in the design were fields (F’), sections within fields (S'), and locations 
within sections (L). Suppose a = 12 fields and b; = 2 sections per field are 
sampled. Suppose in fields 1 through 6 that c;; = 2 locations per section are 
sampled, and in fields 7 through 12 c;; = 1 location per section is sampled. 
This staggered design has six replications each of structures 1 and 3 from 
Display 7.2 as shown in Display 7.3. The design is one of those listed by 
Goldsmith and Gaylor (1970). 


Display 7.3 Staggered Nested Design for Soil Samples 


Structure 1 Structure 3 
Fields 1 through 6 Fields 7 through 12 


Field 


The analysis of variance outline for this design shown in Table 7.11 follows 
the format shown in Table 5.8 and utilizes Equation (5.23) to compute the coeffi- 
cients for the variance components in the expected mean squares. The near equal 
degrees of freedom for the mean squares provide near equal information on each 
source of variation. The variance components are estimated by the analysis of vari- 
ance method as 


MS(S/F)- MS(L/S 
By =MSL/S), Gq, = MES) ere 


and (7.13) 


a9 _ [1S0MSF ~1.48MS(S/F) — (1.50 — 1.48)MS(L/S)| 
5 ae (2.97)(1.50) 
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Table 7.11 Analysis of variance outline with expected mean squares for the 
staggered nested design 


Source of Degrees of Mean Expected 
Variation Freedom Square Mean Square 

Field aol = 1 MSF O21) - 1.4805.) + 2.970? 
Sections » bea 42 MS(S/F) o2) + 1.500% (4) 


Locations N- 
i 


b=12.  MS(L/S), 02, 


a 


7.4 Nested and Crossed Factors Designs 


Certain experimental conditions give rise to factorial arrangements that contain 
crossed and nested factors. In this case, some factors are crossed in the usual fac- 
torial arrangement of levels, whereas other factors are nested within the cells 
of the factorial arrangement or within levels of at least one other factor. These de- 
signs are sometimes called nested factorial designs (Anderson & McLean, 1974; 
Hicks, 1973; Smith & Beverly, 1981). 


Example 7.5 Spectrophotometer Evaluation 


Companies that manufacture machines and instruments have research and 
development departments to create new instruments or improve their current 
line of instruments. Performance tests are a regular part of the development 
stage of any new machine. The machines are tested for their mechanical or 
electrical functions, their accuracy in performing their designated function, 
and so forth. 


Research Problem: In one such setting a researcher was developing a new 
spectrophotometer for applications in medical laboratories. A model spectro- 
photometer had been constructed according to the proposed design and was 
ready for evaluation of its spectral capabilities in a laboratory setting. It was 
necessary to determine whether this particular design determined spectral 
properties over the required range of serum glucose standards. The researcher 
had to determine whether the variability and consistency of results over mul- 
tiple runs and days were within the required specifications. 


Treatment Design: A factorial treatment design was used with 
“concentrations” of glucose and “days” as factors. The serum samples were 
enhanced with three different levels of glucose to cover the range of glucose 
concentrations the instrument should be able to analyze. All three concen- 
trations were analyzed on each day, so that concentrations were crossed with 
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days in a3 x 3 factorial arrangement. Two runs of the instrument were made 
on each day, so that runs were nested within each day. 


Experiment Design: Four replicate serum samples were prepared for each of 
the three concentrations of the glucose standards each day. Two samples of 
each concentration were assigned randomly to each run of each day. The six 
samples were analyzed in random order on each run. The same technician 
prepared the samples and operated the instrument throughout the experiment. 

The design had nested and crossed factors with a = 3 concentrations 
crossed with b = 3 days with c =2 runs nested within each day and r = 2 
replicate serum sample preparations for each concentration in each run. The 
glucose concentrations observed on the spectrophotometer are shown in 
Table 7.12. 


Table 7.12 Glucose concentrations (mg/dl) for three standard concentration 
samples from two runs of a spectrophotometer on each of three days 


Day | Day 2 Day 3 
Concen. Runl Run2 Run3 Run4 : Run5 Run6 
] 41.2 41.2 39.8 41.5 41.9 45.5 
42.6 41.4 40.3 43.0 42.7 44.7 
2 | ee eee 132.4 134.4 sia (44tA 
136.8 143.3 1305 - 1300 ta02 4393 
3 165.2. [614 [73.6 Fas 66.6 173.0 
163.3 180.3 195d 22 ATSB 165:5. 1720 


Source: Dr. J. Anderson, Beckman Instruments, Inc. 


Statistical Model for Nested and Crossed Factors 


The statistical model for this particular experiment is 


Yijkl = + 04 +b; + cyj) + (ad)ij + (AC)in(7) + Cage (7.14) 
1ST 6 pS Pa SE eee FS 1,Z,...,7 


where aq; is the fixed effect for Concentration, b; is the random effect for Day, cx,j) 
is the random effect for Runs nested within Day, (ab);; is the random effect for 
Concentration by Day interaction, (ac);,,;) is the random effect for Concentration 
by Run interaction nested within Day, and e; jx; is the random experimental error. 
The assumptions for the effects are consistent with those indicated in previous sec- 
tions on random, mixed, and nested models. 

The model effects for the two crossed factors, Concentration and Day, follow 
the usual convention with main effects and interaction. The model effect for Runs 
within Day follows the usual convention for one factor nested within another. The 
interaction effect for Concentration by Run nested within Day is the one new fea- 
ture in the model. Since each concentration is evaluated on each run the two factors 
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constitute a complete factorial arrangement on each day. Therefore, the Concentra- 
tion by Run interaction can be evaluated on each day and can be nested or pooled 
over days since the Runs nested within Day are unique runs on each day. 


Expected Mean Squares 


The model shown in Equation (7.14) is a mixed model with one of the crossed 
factors random and the other fixed. The expected mean squares for the analysis of 
variance are affected by the model assumed for the crossed factors and the manner 
by which the other factors are nested in the experiment. Consequently, a variety of 
expected mean square patterns are possible. The expected mean squares for models 
with A and B fixed or with A fixed and B random are shown in Table 7.13. 


Table 7.13. Expected mean squares for a nested factorial with A and B crossed 


and C’ nested within B 
: Expected Mean Square 

Source of — Degrees of 
Variation Freedom A Fixed, B and C Random A and B Fixed, C Random 
Total abcr — | 
A G=— Te ep 107 4, + hoy + bere, Oo? +7074) + ber Os 
B b—1 o7+ ro" c(b) fe ara) a Cro", + acro; of 10“ 0(b) + aroi) sg acr6? 
AB (a—1)(b-—1) o7+ TO Ac(b) + cro?, go? + 07a) FeP0gs 
C/B b(c—1) o*+ TO a0(b) + arazy) il LLP aor ara) 
AC/B b(a—1)(ce-—1) o7+ TO a0(b) o7 + TO“ 0b) 
Error abc(r—1) 0° o? 


Degrees of Freedom 


The degrees of freedom for the sources of variation in the analysis of variance 
follow the usual conventions for the crossed factors, A and B (Section 6.4). The 
degrees of freedom for C’ nested within B follow the convention for one factor 
nested within another (Section 7.3). The degrees of freedom for the AC interaction 
nested within 6 follows the nesting degrees of freedom convention. Only in this 

_ case the interaction measured at each level of B has (a — 1)(c — 1) degrees of free- 
dom; therefore, when nested over b levels of B there are b(a — 1)(c — |) degrees of 
freedom. The degrees of freedom for error are the degrees of freedom for each cell, 
r — 1, pooled over the abc cells in the experiment. 


Analysis for Nested and Crossed Factors 


The analysis of variance for the observations from the spectrophotometer evalua- 
tion of Example 7.5 is shown in Table 7.14. Four different mean squares are re- 
quired as denominators for the Fo statistics to test hypotheses about the effects or 
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variance components in the model with one fixed factor and two random factors 
(see Tables 7.13 and 7.14). The mean square for Error is the denominator of Fo to 
test concentration x runs within day, CR/D. The mean square for CR/D is the 
denominator of Fp required to test runs within day, R/D, and concentration x day 
interaction, CD. The mean square CD is the denominator of Fo required to test dif- 
ferences among the concentration means, C. Synthesized mean squares for Fo re- 
quired to test day variation, D, are 


MSN = MS(D)+ MS(CR/D) = 42.4 
for the numerator and 
MS = MS(CD)+ MS(R/D) = 131.8 


for the denominator. Obviously, Fp = MSN /MS = 42.4/131.8 = 0.32 will not 
be significant. Many programs automatically use the mean square listed for experi- 
mental error for all of the Fp statistics, which is not always the correct denominator 
mean square. Those programs must be given special instructions to compute the 
correct Fp statistics if they have the capability. 


Table 7.14 Analysis of variance for spectrophotometer glucose measurements 
from a factorial with nested and crossed factors 


Source of! Degrees of Sum of Mean 

Variation Freedom Squares Squares F feo F 
Total 35 108,934.1 

2 108,263.6 54,131.8 1,227.48 .000 
D Z 24.9 12.4 * * 
CD 4 176.4 44.1 1.47 el 
R/D 3 263.1 87.7 222 122 
CR/D 6 180.2 30.0 21.43 .000 
Error 18 25.8 1.4 


*Test with synthesized Mean Square 
*C = Concentration, D = Day, R/D = Run nested in Days 


As expected the concentration differences were significant, Fy = 1227.48 with 
Pr>F=.000. The concentration x day interaction was not significant, 
Pr > F = .321, which indicates relatively consistent performance of the instru- 
ment from day to day with respect to concentration measurements. However, 
concentration x run within day was significant, Fp = 21.43 and Pr > F = .000. 
Runs within days with Fp = 2.92 and Pr > F = .122 was not significant. The run- 
to-run consistency of the instrument across concentrations requires some inspec- 
tion. The inconsistency could be due to the operation of the instrument or the lack 
of consistency in preparation of the samples for each of the concentrations from run 
to run. 
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7.5 How Many Replications? 


Random Models 


Replication numbers to detect desired significant contributions from a component 
of variance require a value for the \ constant where the Fo statistic has the central 
F,,,. distribution multiplied by A? (see Section 5.8). The value for \? may be 
evaluated in general as follows. Let Fo be the ratio of mean squares Fo 
= MSN/MSD, where MSN and MSD, respectively, designate the mean 
squares for the numerator and denominator of Fo. The constant ? is the ratio of the 
expected mean squares, or \? = E(MSN)/E(MSD). Technical details may be 
found in Graybill (1961). The charts in Appendix Table X are used as described in 
Section 5.8. 

Consider the two-factor random effects model in Section 7.1. A test of Hp: 
o? = 0 requires Fy = MS. A/MS(AB), so that 


E(MSA - a = 
ee ( ) ee Ie aa ee maar (7.15) 
E|MS(AB)| o? +ro-, o? + ro“, 
Mixed Models 


The detection of prescribed fixed-factor effects, say factor A, for the two-factor 
mixed-model experiment requires 


a ee 
ee 7.16 
a(o? + ra2,) oss 
for the charts in Appendix Table IX for fixed effects. The value of the constant A 
for tests about a? and o?, can be determined as shown for the random model. 


7.6 Expected Mean Square Rules 


The rules for expected mean squares given in this section apply to most balanced 
designs with equal replication numbers. The number of levels for any factor do not 
vary within the balanced designs. The designs include crossed factorials, nested 
factorials, and mixtures of crossed and nested factors. The rules are adapted from 
those given in various publications, including Bennett and Franklin (1954) and 
Mason, Gunst, and Hess (1989). Many computer programs have commands to pro- 
duce expected mean squares for an analysis of variance. 


Rules Illustrated with the Unrestricted Mixed Model 


The rules are exemplified with a two-factor mixed-model experiment, A fixed and 
B random, with r replications of each treatment combination. 
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1. Write out the linear model for the design: 
Vig = pet ag +b; + (eb); Peis 
perl de oe Le, SE 


Note the replication subscript k is nested within the zjth treatment 
combination. 


2. Construct a two-way table with 
(a) a row for each term in the model, excluding py, labeled with the model 
term and 
(b) acolumn for each subscript used in the model. 


3. Over each column subscript write the number of factor levels for the subscript 
and write “R” if the factor is random and “F” if the factor is fixed. 


4. Add another column with entries as the appropriate fixed or random variance 
component for the effect represented by that row in the table. 


F R R 

a b ¢ 
Source a 7 k; Component 
A Ol; 62 
B b; O : 
AB (ab); j Oo a 
Error Ck(i3) o? 


5. For each row, if the column subscript does not appear in the row effect, enter 
the number of levels corresponding to the subscript. 


F R R 

a b r 
Source 1 4 k Component 
As Qj b r 62 
B b; a r o? 
AB (ab); j f Oo os 
Error €k(i7) eo 


6. If a subscript is bracketed in a row effect, place a 1 in cells under those 
subscripts that are inside the brackets. 
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R R 
a b if 
Source a 9 k; Component 
A QQ; b ‘Z 6? 
B b; a ‘i a 
AB (ab);; 1 a, 
Error €k(ij) ] l o? 


(a) For each row, if any row subscript matches the column subscript, enter a 0 
if the column represents a fixed factor F and there is a fixed component of 
variance for the effect represented by the row. 

(b) Enter a | in the remaining cells. 


F R 

a b r 
Source a a I; Component 
A Q; 0 b r 62 
B b; a | r oF 
AB (ab); j l | Tr o. - 
Error €k(ij) ] ] l o? 


To determine the expected mean square for a specific source of variation: 

(a) Include co? with a coefficient of 1 in all expected mean squares. 

(b) Of the remaining variance components include only those whose corre- 
sponding model terms include the subscripts of the effect under con- 
sideration. For E(M SB) the b; effect, include o?, and o in addition to 0”. 

(c) Cover the columns containing non-bracketed subscripts for the effect 
under consideration. For a; cover 2 and for e;;;;) cover k. 

(d) The coefficient for each component in the E(/'S) is the product of the re- 
maining columns of the row for that effect. For EUMSB) the column with 
7 is covered so that only the values in columns / and k are visible. For the 
(ab) row the visible values are | and r so that the coefficient for 0%, is 
1-r=r. For the 5; row the visible values are a and r so that the coeffi- 
cient for 07 is a-r. 


F R R 

a b , 
Source a q k; Component E(MS) 
A ngs x) b r g° o? + ro?, + bré? 
B b; a l r 0; a? + ro?, + aro; 
AB (ab);; l 1 r ope a? + ro?, 
Error (5) l 1 l a” eas 
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Illustration: The complete table for expected mean square determination of a 
mixed-model factorial with crossed and nested factors follows. Factors A and B are 
fixed and crossed, and factor C’ is random and nested within B across A. The 


model is 
Yigkt = + 0G + Bj + (AB)ij + Cuz) + (aC)in(5) + excazk) 

ae ee ee 

aie. aa aes 
Source i. 9- kh... 1. Component E(MS) 
A C2 Don « Crake @? O° + 70774) + ber 0; 
B Bj sae oBrpe: ip 0; O° +7074) + Aros) + acrOF 
AB lap): O Fee a o7 + 10 ae(b) 5 cre: 
C/B Cn gy @ bebe O21) 2 +1704) + Aro) 
(ACY (elgg 1 41 ae b) FIL 
Error El(ijk) l 1 | 1 go a? 


Alteration of the Rules for Restricted Mixed Models 


One alteration of the method in Step 7 is required if the restricted mixed model is 
used—that is, the model in which the interaction effects are correlated and summa- 
tion of interaction effects over the fixed effects subscripts is restricted to zero. Step 
7 (a) will be “For each row, if any row subscript matches the column subscript, 
enter a 0 if the column subscript represents a fixed factor F’.” Step 7 (b) remains 
unchanged. 


Other Estimation Methods for Variance Components 


Only analysis of variance estimators have been considered for estimation of vari- 
ance components in Chapters 5 and 7. Estimation of variance components by the 
analysis of variance method is relatively straightforward with balanced data (that is, 
all data cells contain the same number of observations). Estimation of variance 
components is much more difficult when data are unbalanced. The first major 
breakthrough for variance component estimation was made by Henderson (1953), 
who presented three different adaptations of the analysis of variance method for 
estimating variance components with unbalanced data and random or mixed 
models. 

Since that time, other methods of estimation have been developed, including 
maximum likelihood (ML) estimation by Hartley and Rao (1967) and a modi- 
fication of maximum likelihood known as restricted maximum likelihood (REML) 
attributed to work by Thompson (1962) and Patterson and Thompson (1971). The 
MINQUE method to find minimum variance quadratic unbiased estimators can be 
traced to a variety of authors. These methods of estimation are usually available in 
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many of the more comprehensive statistical packages. A comprehensive treatment 
of variance component models and estimation can be found in Searle et al. (1992). 


a 


EXERCISES FOR CHAPTER 7 


a 


1. Cholesterol was measured in the serum samples of five randomly selected patients from a pool of 
patients. Two independent replicate tubes were prepared for each patient for each of four runs on a 
spectrophotometer. The objective of the study was to determine whether the relative cholesterol 
measurements for patients were consistent from run to run in the clinic. The data are mg/dl of 
cholesterol in the replicate samples from each patient on each run. 


Patient 
Run ] 2 3 4 3S 
] 167.3 186.7 100.0 2144.5 148.5 
166.7 ~1842°° 1079 215.3 “148.5 
Zz 1796 193%" Tile 226.9 “1565 
175.3. 198.9 1144 220.4 154.7 
4 169.4 179.4 105.9 208.2 144.7 
1659 1776. O41. 207.1 - 143.9 
4 177-1 F904 FISA. 2288 - 136-1 
P74 102 4 = ae ee 


Source: Dr. J. Anderson, Beckman Instruments, Inc. 


a. Write a linear model for the experiment assuming patients and runs are random effects, explain 

the terms, and conduct an analysis of variance for the data. 

Show the expected mean squares for the analysis of variance. 

Estimate the components of variance for runs, patients, and interaction. 

d. State the null and alternate hypotheses for main effects and interaction, test each of the null 
hypotheses, and interpret your results. 


ee 


2. An animal scientist conducted an experiment to study the effect of water quality on feedlot per- 
formance of steer calves. Four water quality treatments were used for the experiment. The water 
sources were designated as normal (N) and saline (S). The saline water was formulated to approxi- 
mate the mineral concentrations in some underground water sources utilized in practice for water- 
ing livestock. Four combinations of water used in two consecutive 56-day periods of the 
experiment were N-N, N-S, S-N, and S-S. The feeding trial consisted of the four water treatments 
with two replicate pens of animals for each treatment in a completely randomized design. The trial 
was conducted on two separate occasions (two consecutive summers). The resulting design is a 
factorial arrangement of four water treatments and two summers. The water treatments are con- 
sidered fixed effects and summers are considered random effects, so that a mixed model is appro- 
priate for the study. The data are the average daily gains for the 16 pens of steers. 
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foe oe 


Water 
_ Summer N-N N-S~ S-N — S-S 
| £00. °2e8 - 7356 = 243 
Didi dO----2,38----2,50 
2 ao 1 ae Ie 


Be gt ee 


Source: Dr. D. Ray, Department of Animal Sciences, 
University of Arizona. 


Write a linear model for the experiment, explain the terms, and conduct an analysis of variance 
for the data. 

Prepare a table of cell and marginal means and their respective standard errors. 

Show the expected mean squares for the analysis of variance. 

Test the null hypotheses for main effects and interaction, and interpret your results. 

The water treatments have a 2 x 2 factorial arrangement. The first factor (A) is normal or 
saline water in the first 56-day period, and the second factor (B) is normal or saline water in 
the second 56-day period. Write the linear model for the experiment with this arrangement, 
considering summers as random effects and factors A and B as fixed effects. Repeat parts (a) 
through (d) of this exercise with the new model. 


Three formulations of an alloy were prepared with four separate castings for each formulation. 
Two bars from each casting were tested for tensile strength. The data are tensile strengths of the 
individual bars. There are four castings nested within each alloy. 


ale 


e. 


Castings 

Alloys 1 2 Es 4 
A 9.209035 2 PRS 14.6 
oe a 

B iT “ee ee 
PO.) °*t7 0 toe 10,6 

C mt ipa ae 8 
48 158 ya 


Write a linear model for the experiment, assuming alloys as fixed effects and castings within 
alloys and bars within castings as random effects. Explain the terms, and compute the analysis 
of variance. 

Show the expected mean squares for the analysis. 

Test the null hypothesis for alloy effects, and interpret your results. 

Compute the estimated means and the 95% confidence interval estimates for the means of each 
alloy. 

Estimate the components of variance for castings and bars. 


A traffic engineering study was conducted to evaluate the effects of three traffic signal types on 
traffic delay at intersections. The study was also designed to evaluate two methods for measuring 
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traffic delay. The three signal types were pre-timed, semi-actuated, and fully actuated signals. The 
two methods, point-sample and path-trace, estimated stopped time per vehicle at an intersection. 

Two intersections were used for each signal type. Measurements were made during rush hour 
and nonrush hour periods. The three crossed factors in the study were signal type, method, and 
time of day (rush hour and nonrush hour). The intersections were nested within signal types but 
crossed with method and time of day since both methods were used on the same intersection during 
both times of day. The data were traffic delay measured as seconds per vehicle. 


Point-Sample Path-Trace 
Signal Intersection Rush Nonrush Rush Nonrush 

Pretimed l 61.7 57.4 53 to236S 
¢ A086 ee x 15.9 

Semi-actuated 3 20.0 24.6 PFD 27H 
4 val | ot ite, 1.1 

Fully actuated 5 30.7. eae Soa 20a 
6 Fr, Ge Bi i ic Rais sti ke 


Source: W. Reilly, C. Gardner, J. Kell (1976), A technique for measurement of delay 
at intersections. Technical Report, FHWA-RD-—76-135, Federal Highway 
Administration, Office of R&D, Washington, D.C. 


a. Consider only one method of measuring stopped delay (point-sample or path-trace). Write a 
linear model for the study, assuming signal type and time of day are fixed effects. Sketch the 
analysis of variance table, including source of variation, degrees of freedom, and expected 
mean squares. 

b. Suppose you suspect an intersection x time-of-day interaction. Will you be able to test a null 
hypothesis about the interaction? 

c. Suppose there is an interaction between intersection and time of day. Which hypotheses can 
you test from the analysis of variance? 

d. Compute the analysis of variance for the point-sample data. State your assumptions about the 
model, and test the hypotheses that can be tested with your stated model. 

e. Now suppose you want to include the factor for method of measurement into the analysis— 
that is, point-sample versus path-trace as a fixed effect factor. Write a linear model for the 
analysis, and sketch the analysis of variance table, including sources of variation, degrees of 
freedom, and expected mean squares. 

f. What assumptions are necessary about the interaction between intersection and the other 
factors for you to test some hypotheses from the analysis of variance? 

g. Compute the analysis of variance for the entire data set. State your assumptions about the 
model, and test the hypotheses that can be tested about model effects. 


An experiment was conducted to compare the accuracy of two mass spectrometers in measuring 
the ratio of ‘4N to !°N. Two soil samples were taken from each of three plots of land treated with 
1°N|. Two subsamples of each sample were analyzed on each of the two machines. The resulting 
design has machines crossed with plots and samples. However, the samples are nested within plots. 
The data are ratios of !“N to !°N (after multiplying by 1000). 
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Plot ] vs 3 
Sample 1 a 3 4 > 6 
Machine A 3.833 3.819 S150, 9 Bod 3. Lay: 2.429 
3.866 3.853 FA Bien ees 3.220... 108 
Machine B_ 3.932 3.884 2632 23.914 Jk Pico, Behe 
3.943 3.888 3.829 ,,3.915 3.444 3A%827 


Source: D. Robinson (1987), Estimation and use of variance components. 7he 
Statistician 36, 3-14. 


a. Write a linear model for the experiment, assuming machines with fixed effects and plots and 
samples with random effects; explain the terms; and compute the analysis of variance for the 
data. 

b. Show the expected mean squares. 

c. Test the null hypothesis of no difference between the means for the two machines. 


6. Use the rules given in Section 7.6 to derive the expected mean squares for the following studies or 
models: 

the cholesterol study in Exercise 7.1 

the cattle-feeding trial in Exercise 7.2 

the alloy casting experiment in Exercise 7.3 

the traffic study in Exercise 7.4 

the soils study in Exercise 7.5 

a four-stage nested design with the model 


moaogs 


Visa = ph +a; +b dy cag aiazK) 
peel 4 je A eS 
g. amodel with nested and crossed factors written as 


Yiskim = b+ 04 + Bj + (aB)iz + Ceaizy + 61 + (@O)a + (G0); 
+ (a86)ij1 + (Cd) Ki(43) + em(izxt) 
bet 238? FH lek cP Oe ee 


where a, 3;, 5,, and their interactions are fixed effects and c,,;;), (cd)x1(i;), ANA Em(ijx1) are 
random effects 


‘2 How would your statistical inference change if the model with restrictions on the interactions 
had been used for Example 7.5? 


G Complete Block Designs 


Experiment designs to improve the precision of results from research studies are 
the topics of discussion in this chapter and others to follow. Blocking was introduced 
in Chapter 1 as a method to reduce experimental error variation. Blocking groups 
the experimental units into homogeneous blocks to compare treatments within a 
more uniform environment. The designs in this chapter use either one grouping cri- 
terion in a randomized complete block design or two grouping criteria in Latin 
Square arrangements. The features, randomization, analysis, and evaluation of 
these designs are discussed. Extensions of the designs include factorial treatment 
designs, multiple experimental units per treatment in each block, and subsampling. 
Conducting analysis when some observations are missing is discussed. The topic 
for the final section in this chapter is combining the results from several repetitions 
of the same experiment at several places or several times. 


8.1 Blocking to Increase Precision 


Our objective is to have precise comparisons among treatments in our research 
studies. Blocking is a means to reduce and control experimental error variance to 
achieve more precision. 

Previous chapters concentrated on treatment designs and the associated statis- 
tical methods for efficient analysis of research hypotheses. All of the illustrations 
utilized completely randomized designs. However, outside of appropriate exper- 
imental unit selection and good research techniques, the completely randomized de- 
signs provide no control over experimental error variance. The experimental units 
are assumed to be relatively homogeneous with respect to the measured response 
variable in completely randomized designs. However, sometimes sufficient num- 
bers of homogeneous units do not exist for a complete experiment with these 
designs. 
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Any factor that affects the response variable and that varies among the expe 
mental units will increase the experimental | error variance and decrease the preci: 
sion of the experimental results. Factors such as age or weight of animals, different 
batches of reagents or manufactured material, gender of human subjects, and physi- 
cal separation of field plot locations are examples of variables external to the treat- 
ments that can increase the variation among observations on the response variable. 

Blocking stratifies experimental units into homogeneous groups, or like units. 
A successful choice of blocking criteria results in less variation among the units 
within the blocks than that among units from different blocks. General pee of 


Spores 

A group of neighboring field plots forms a block in agronomic field experi- 
ments. Animals grouped by weight, stage of lactation, or litter form blocks of 
homogeneous experimental units. The engineer uses a single batch of manufactured 
material to form a block or homogeneous group of experimental units for the treat- 
ments. Laboratory experiments use technicians as a blocking factor to eliminate 
variation among technicians. Each technician performs one replication of the treat- 
ments as a block. 


8.2 Randomized Complete Block Designs Use One Blocking 
Criterion 


The randomized complete block design is the simplest of the blocking designs used 
to control and reduce experimental error. The experimental units are stratified into 
blocks of homogeneous units. Each treatment is assigned randomly to an equal 
number (usually one) of experimental units in each block. More precise compari- 
sons are possible among treatments within the homogeneous set of experimental 
units in a block. Blocking turned out to be very beneficial in the following study. 


Example 8.1 Timing of Nitrogen Fertilization for Wheat 


Current nitrogen fertilization recommendations for wheat included applica- 
tions of specified amounts at specified stages of plant growth. The recom- 
mendations were developed through the use of periodic stem tissue analysis 
for nitrate content of the plant. Stem tissue analysis was thought to be an 
effective means to monitor the nitrogen status of the crop and provide a basis 
for predicting required nitrogen for optimum production. 


Research Objective: In certain situations, however, the stem nitrate tests 
were found to overpredict the required nitrogen amounts. Consequently, the 
researcher wanted to evaluate the effect of several different fertilization tim- 
ing schedules on the stem tissue nitrate amounts and wheat production to re- 
fine the recommendation procedure. 
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Treatment Design: The treatment design included six different nitrogen 
application timing and rate schedules that were thought to provide the range 
of conditions necessary to evaluate the process. For comparison, a control 
treatment of no nitrogen was included as was the current standard 
recommendation. 


Experiment Design: The experiment was conducted in an irrigated field with 
a water gradient along one direction of the experimental plot area as a result 


ec ae Pe lees: bie %, hice ; & od 
" Ve 2 7 R a e x » 
SDOnSES dl ware | Ka rLEGa DY. : 


of irrigation. Since plant 


3 e2 wate |. Thus, 


any differences in plant responses caused by the water gradient could be asso- 
ciated with the blocks. The resulting experiment « 


en treatments were randomly allocated. 

- The layout of the experimental plots in the field is shown in Display 8.1. 
The observed nitrate nitrogen content (ppm x 1077) from a sample of wheat 
stems is shown for each plot along with the treatment numbers, which appear 


in the small box of each plot. 


wet 
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Display 8.1 Arrangement of Experimental Plots for the Wheat 
Experiment in a Randomized Complete Block Design 


Irrigation 
Gradient 
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Source: Dr. T. Doerge, Department of Soil and Water Science, University of Arizona. 


How to Randomize the Design 


The ran 


y 
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dom allocation of treatments to experimental units is restricted in the ran- 
ch tr 


eatment must occur an equal 
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number of times (one or more) within each block. Randomization is illustrated with 
the wheat experiment of Example 8.1. 

A random permutation of the order in which the treatments are placed with the 
units in each block provides a random allocation of treatments to units. There are 
6! = 720 possible permutations of the six treatments. One permutation is randomly 
selected for each block since a separate randomization is required for each block. 

Assign the treatment label, such as A, B, C, D, E, F, to the respective integer 
values 1, 2, 3, 4, 5, 6. Obtain a random permutation of the integer values from a 
computer program or a table of permutations found in Appendix Table XIII. One 
such permutation is shown in Display 8.2. Given the permutation 2, 5, 4, 1, 6, 3, 
assign treatment B to unit 1, treatment E to unit 2, treatment D to unit 3, and so 
forth in block 1. 


Display 8.2 Assignment of Treatments to Experimental Units 
in a Complete Block 


Permutation 
Treatment B E A F 
Experimental unit 


Separate random permutations are required for each of the remaining three 
blocks. Given additional random permutations, say (1,3,4,6,5,2), (6,3, 5, 1,2, 4), 
and (2,4, 6,5,3, 1), the final assignment of treatments to units within sho block | is 
shown in Figure 8.1. 


sates Model and Analysis for Randomized Complete Block Design 


experiment in a randomized complete block design re- 


, LHI dI | IU 
= oem et 25 


quires a ae fapreselst the variation identifiable in the Sbacrrations as a conse- 


uence of blocking. The response of the unit with the ith treatment in the jth block 
is written as 


(8.1) 


(Ser ee howe 


where yp is the general mean, 7; is the treatment effect, and e;; is the experimental 
error. The block effect t p; represents the average deviation of the units in block 7 
er SDE SES: 

from the ‘general m mean. The treatment and block effects are assumed to be additive. 
Additivity means there is no interaction between treatments and blocks. The experi- 
mental errors are assumed independent with zero means and common variance o”. 
The independence assumption is justified through random assignment of treat- 
ments to the experimental units. 
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Irrigation 
Gradient 


| 


Figure 8.1 Randomized assignment of treatments in a randomized complete 
block design 


Blocks Add a Sum of Squares Partition to the Analysis of Variance 


The basic data table for a randomized complete block design is shown in Table 
8.1. The deviation of any observation from the grand mean in Table 8.1, y;; — 7, 
may be written as the algebraic identi 


ty cali 


ee Nae aye ay ae ee ee ae ae See eet oe 
ee aE See ee pe ec 
if ‘ ' | ae 2 
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Table 8.1 Data table for a randomized complete block design 


Block Treatment 
Treatment 1 Z = a Means 
] — Yi2 ae Yir 1. 
2 Yo Yy22 =e Yor Yo, 
t Ytl Yi2 eae Ytr Ut. 
Block means Y, Y9 ve Yee Y. 


The terms on the right-hand side of Equation (8.2) are 


e atreatment deviation (y; — 7 ) 
e a block deviation (y ; — ¥.) 
e experimental error (y;; — Y;, — 9; + J.) 


For example, the means for the wheat experiment in Example 8.1 (shown in Table 
8.3) are used to illustrate the deviations for treatment 1 in block 2, yy2, as 
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e the treatment | deviation: y, — 7, = 38.28 — 42.07 = — 3.79 
e the block 2 deviation: 7. — 7. = 45.89 — 42.07 = 3.82 

e the experimental error for y;9: 

yo — Hy, —Bot¥. = 41.22 — 38.28 — 45.89 + 42.07 = — 0.88 


and the sum of the three deviations, — 3.79 + 3.82 — 0.88 = — 0.85, is the same 
as the total deviation y;2 — y_ = 41.22 — 42.07 = — 0.85, as it should be. 

On closer inspection the last two terms of Equation (8.2) compose an algebraic 
identity for the deviation of the observation from the treatment mean 


Y= SH HS a + Og a FD (8.3) 


e experimental error deviation from the completely randomized design, 
v= Gy, _ is ational me two components. The first term is identified with the 
blocking criteria as 7; — 7. The second term is identified only as a residual or 
experimental error, yj; — Y;, — Yj + Y... 

Squaring and summing both sides of Equation (8.2) results in 


t 


Tr t T 
SS Ge -BY =) GG. -— 9) +t > G,-7y 
| i=1 s=1 


i=1 j=l 


Tr 


+ (yij —F;. —V, +9. (8.4) 


t 
a=] j=] 


Or 
SS Total = SS Treatment + SS Blocks + S'S Error 


The sum of any crossproducts from the right-hand side 1 is zero. Table 8.2 summa- 
rizes the sum of squares partition. 


Table 8.2 Analysis of variance for an experiment in a randomized complete block 


design 
Source of Degrees of Sum of Mean Expected 
Variation Freedom Squares Square Mean Square 
Total ti igo y 
tj 

Blocks pad aE Hayy MSB 

j 
Treatments t—1 roy, —y.) MST o* + ré? 

t 


Error (r—1)\t¢—1) SS Error MSE o? 
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The observations on stem nitrate nitrogen along with treatment and block 
means for the wheat fertilization study are shown in Table 8.3. The analysis of vari- 
ance for the stem nitrate data is shown in Table 8.4. 


Table 8.3 Nitrate nitrogen in wheat plant stems (ppm x 10~7) for six nitrogen- 
timing treatments in each of four blocks of a randomized complete block design 


Nitrogen Block Treatment 
Timing Schedule ] 2 3 4 Means (Y; ) 

Control 34.98 41.22 36.94 39.97 38.28 

2 40.89 46.69 46.65 41.90 44.03 

3 42.07 49.42 52.68 42.91 46.77 

4 37.18 45.85 40.23 39.20 40.62 

5 37.99 41.99 37.61 40.45 39.51 

6 34.89 i a oe, 44.57 43,29 43.23 


Block Means (Y ;) 


Table 8.4 Analysis of variance for wheat stem nitrate nitrogen 


Source of — Degrees of Sum of Mean 

Variation Freedom Squares Square F freee 
Total Zo 506.33 

Blocks 3 197.00 65.67 9.12 

Nitrogen 5 201.32 40.26 5.59 .004 


Error 15 108.01 hu) 


As a consequence of blocking, a sum of squares for blocks is partitioned out of 
what would have been the sum of squares for experimental error with the complete- 
ly randomized design. The blocked design will markedly improve the precision on 
the estimates of a treatment means if the reduction in SS Error with blocking is 

The reduction in SS - vt a be wows ae a ae in degrees of 
freedom, since r- =i f the degrees of freedom must be allocated to S'S Blocks. A 
measure of ne eorrey. ane later j in disk pomeree is ~ nero to evaluate 
the full benefit of blocking. 


Standard Errors for Treatment Means 


The standard error estimate for a treatment mean is 


7.20 


/MSE _ 
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and a 95% confidence interval estimate of any treatment mean in Table 8.3 is 


Y;, = t.o25,15(Sz, ), where to2515 = 2.131. The standard error of a difference be- 
tween any two treatment means is estimated by 


a [2MSE 2(7.20) 
teal meer at Rl (8.6) 


Tests of Hypothesis About Treatment Means 


The Fo statistic to test the null hypothesis of no differences among the treat- 

ment means is 
ewpr 40.26 
ee 720 °° nee 

which exceeds the critical value of F’o5,5,15 = 2.90. The observed significance level 
is Pr > F = .004 (Table 8.4). There are significant differences among the nitrogen 
treatments with respect to stem nitrate nitrogen at this stage of plant development. 

There is little interest in formal inferences about block effects, and the Fo sta- 
tistic generally is not computed for this purpose even though it may appear in the 
output of a computer program. The extent to which blocking increased the efficien- 
cy of the design to utilize existing resources is discussed later in this section. 


Interpretations with Multiple Comparisons 


Schedule 4 was the standard fertilizer recommendation for wheat. The nitrate 
nitrogen in the stem of the wheat plant measured throughout the growing season is 
used to assess nitrogen requirements for optimum wheat yields. The investigator 
would be interested in differences between any of the individual nitrogen-timing 
treatments and the current recommendation at each stage of growth. The Dunnett 
method (Chapter 3) can be used to compare the standard recommendation with 
each of the other timing treatments, including the no-nitrogen control treatment. 
The no-nitrogen control provides a means of evaluating the nitrogen available with- 
out fertilization in these particular plots. (See Chapter 1 on the use of control 
treatments.) 

The Dunnett 95% simultaneous confidence intervals require the standard error 
of the difference, s(y; — y,;) = 1.90, and the Dunnett statistic from Appendix 
Table VI, do55,15 = 2.82, for a two-sided comparison. 

The 95% SCI for the difference between the mean of any other schedule for 
nitrogen application and schedule 4 requires D(5,.05) = do5515|s(Y;. — Y4.)| = 
2.82(1.90) = 5.36 and are computed asy; — y, £5.36. The intervals are given in 
Table 8.5, along with the results of the confident inequalities test in which the abso- 
lute difference is declared significantly different from 0 if the difference exceeds 
D(5, .05) = 5.36. Schedule 3 is the only treatment that has a mean nitrate nitrogen 
level significantly different from the current recommended schedule 4. It has a ni- 
trate content greater than that of schedule 4 since the lower bound of the SCI is 
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greater than 0. The SCI for all other treatment comparisons include 0 and have 
upper and lower bounds considerably removed from 0. 


Table 8.5 Results of the Dunnett method for treatments vs. control (Example 8.1) 


95% SCI 
Schedule Mean uy; = Ge 3 
4 7, = 40.62 re eS 
| 38.28 se ae (7-10. 3:02) 
2 44.03 3.41 (105.877) 
3 46.77 6.15* (0.79, 11.51) 
5 39.51 essed i (5 647, 4425) 
6 43.23 2.61 (= 2.75, 7.97) 


“|, — 9, | exceeds D(5, .05) = 5.36 and is significantly different from schedule 4. 


If Treatments Interact with Blocks 


The assumption of no treatment x block interaction implies that the treatment 
and block effects are additive. The differences among the treatments are assumed to 
be relatively constant from block to block as a consequence of additive block and 
treatment effects even though the use of blocking may result in large differences in 
responses between units from different blocks. The nonadditivity test, introduced in 
- Chapter 6 (Tukey, 1949b), can be conducted to detect nonadditivity of the multipli- 
cative form A7;p;. If the environmental conditions are sufficiently different in one 
or more of the blocks the relative performance of the treatments may be affected. 
For example, if the residual nutrient base in the soil is quite different from block to 
block in a field crops fertility trial there can be little or no response to fertilizers in 
some blocks, whereas the responses can be quite substantial in other blocks. 


Multiple Experimental Units per Treatment in Each Block 


A more general nonadditivity is represented by the general interaction term 
(Tp)i;. To test the existence of interaction the experiment must have more than one 
experimental unit for each treatment within each of the blocks. The linear model for 
an experiment with u experimental units for each treatment in each of r blocks 1s 


Yijh = ATE + pj + (Tp)ig + eije (8.8) 
AEE HOeN Se Bre eS ae 


where e;,;, are the random, independent experimental errors with means 0 and 
variance o”. The variance o? is the variability among experimental units within a 
block that have received the same treatment. The computations for sum of squares 
partitions and the test for interaction are the same as for a two-factor factorial 
arrangement shown in Chapter 6. 


272 CHAPTER8 COMPLETE BLOCK DESIGNS 


Subsamples of Experimental Units 


There are occasions wherein two or more samples from the experimental units 
are required for data collection. Situations requiring subsampling of the units were 
discussed in Chapter 5. The linear model for an experiment in a randomized com- 
plete block design with n subsamples of the experimental units is 


Yijk = WT + pj + Cig + dizk (8.9) 
pale ee 


where the d;,, are the random effects for subsamples with mean 0 and variance a4: 
The other terms are as described in Equation (8.1) for the randomized complete 
block design. The adjustments to the analysis are those shown in Chapter 5. 

Gates (1995) provided a detailed discussion on experimental error estimation 
in block designs with different configurations of experimental units and sampling 
units. 


Analysis of Residuals to Evaluate Assumptions 

The assumptions regarding the experimental errors of the linear model for the 
randomized complete block design can be evaluated with an analysis of the 
residuals (discussed in Chapter 4). The residuals are computed from the 
experimental error deviation component shown in Equation (8.2) as €;; = 
Yij — Ui. — 9; +9... For example, the residual for the control treatment in block | 
of Example 8.1 is 


en = Wil — oy a 
= 34.98 — 38.28 — 38.00 + 42.07 = 0.77 


The plot of residuals versus the estimated values and the normal probability plot of 
the residuals are shown in Figures 8.2a and 8.2b. The assumptions of homogeneous 
variance (Figure 8.2a) and normal distribution (Figure 8.2b) appear to hold for 
these data. 


Did Blocking Increase Precision? 


The expectation. of increased precision in the estimates of treatment means moti- 
vates us to use the randomized complete block design. Planning and conducting an 
experiment with the randomized complete block design requires extra effort rela- 
tive to the completely randomized design. The relative efficiency measure 
(discussed in Chapter 1) evaluates the benefits of blocking for a particular 
experiment. 

The efficiency of a randomized complete block design is compared with that of 


a completely randomized design. The estimate of a7, say s?,,, is the mean square 
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Figure 8.2 Residual plots from the analysis of variance for data on nitrate nitro- 
gen in Example 8.1: (a) square root of the absolute residuals vs estimated values 
and (b) normal probability plot of residuals 


for experimental error from the analysis of variance for the current experiment in a 
randomized complete block design. An estimate of o*? from the completely 
randomized design 0 used, say 82., i is required for the relative efficiency measure. 
The computation of s?. from the information in the randomized complete block 
analysis of variance is (Cochran & Cox, 1957; Kempthorne, 1952) : 
SS Blocks + r(#t—- 1)MSE 


2 
es 8.10 
— rt— 1 see 


The estimate of o?. for the wheat fertilization study is 


197.0 -+- 2007.2) 
2 
= ——_—_ = 14.8 
Sor ee: 
and the estimate of o?,, is s?,, = MSE = 7.2. 
The relative efficiency estimate, without degrees of freedom correction for 
estimates of 07, is 


- 14.8 
S 
RE = —— = =~ = 2.06 8.11 
i oS 


The correction for estimation of 7? by s? is 


(freb + fer +3) _ 16-21 
(frep + 3) fer +1) 18-19 


= 0.98 
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where f,-, = 15 and f., = 18 are the error degrees of freedom for the oe 
complete | block and completely randomized design, respectively. The correction 
duces the RE to 0.98(2.06) = 2.02. The correction has little effect mihi Saeco 
sized degrees of freedom for experimental error variance estimates. 

The randomized complete block design for the wheat experiment is estimated 
to be slightly m an twice as efficient as a setae randomized design. The 


iy the compiely nleiied Hod inst to are equivalent variances for the treatment 
means under the same experimental conditions in that field. Blocking on the irriga- 
tion gradient was effective as a measure to control and reduce the experimental er- 
ror variance estimate in this instance. Future experiments of this same nature would 
likely benefit from the blocking practice. 


A Quick Check for Effective Blocking 


Lentner, Arnold, and Hinkelmann (1989) discuss a relationship between rela- 
tive efficiency and the ratio H = MSB/MSE. They point out that, although the 
ratio H is equivalent to an Fo statistic, a valid test for block effects does not exist. 
From Equations (8.10) and (8.11), the relative efficiency measure can be expressed 
as 


ig 8? oi) MSB + r= DMSE 
SCS (rt -—-1) MSE 


rcb 


(8.12) 


With some rewriting, the expression becomes 
RE=k+(1-bH (8.13) 


where H = MSB/MSE and k= r(t — 1)/(rt — 1). The following relationships 
for RE and H can be determined from Equation (8.13): 


Bi A if and only if 2 
hi =4 if and only if <a 
RE> 1 ifandonlyif H>1 


H can be used to evaluate the effectiveness of blocking even though it is not a 
valid F’ statistic for testing block effects. For example, if RE > 1, eas a SP 
blocking has been effective in reducing experimental error. Fewer replications a 
required for the randomized complete block design relative to the alec ran- 
domized design. The value of H does not provide the complete information about 
relative efficiency but only the implication of greater, lesser, or equal efficiency. H 
is a quick check for whether blocking was effective. 
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Random Blocks 


The blocks of units often constitute a random sample of blocks available to the 
investigator. Sites used as blocks in ecological, forestry, or wildlife studies can be 
random samples of many sites available for the study. Plots may be established in 
each of the sites for the treatments. Manufactured batches of material used as 
blocks for experimental treatments are random batches. The batch of material (such 
as fabric, asphalt, or chemical product) is divided into smaller experimental unit 
batches to which the experimental treatments are administered. Schools used as 
blocking criteria in education studies can be random representatives of available 
schools in the school district. Classrooms within the schools serve as the experi- 
mental units for treatments. 

The base of inference for treatments in a study with random blocks extends to 
a population of blocks from which the blocks in the study are a random sample. As 
a consequence of random blocks, the standard errors for treatment means will be 
different from those for an experiment with fixed blocks. The linear model with 
random block effects is 


Yip pe TR +b; + ey (8.14) 
ere ae Se PP eee 


where p is the general mean, 7; is the fixed treatment effect, b; is the random block 
effect with mean 0 and variance o#, and e;,; is the random experimental error with 
mean 0 and variance o*. Under the mixed model an observation has expected value 
E(yij) = 4+7; and variance o? + of. There is also a covariance of 0} between 
any two observations in the same block with random blocks. 

The variance of a treatment mean with random blocks in the randomized com- 
plete block design is 


l 
oz = —(0° +95) (8.15) 
cs if 


The variance of a treatment mean with random blocks includes the component of 
variance for blocks, Gi, and will be larger than the variance with fixed block ef- 
fects. The variance of the difference between two treatment means is not affected 
by the random block effects and will be the same as that shown for fixed effects in 
Equation (8.6). 


8.3 Latin Square Designs Use Two Blocking Criteria 


Two Blocking Factors May Be Necessary 


Recognition of a factor, other than planned treatments, that influences the response 
variable was important in the experiment on wheat fertilization in Example 8.1. 
Blocking the experimental plots on the basis of the irrigation gradient doubled the 
efficiency of the experiment. 
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In some experimental settings two factors, other than treatments, may influence 
the response variable. Even more precision can be achieved if we can block the 
units on the basis of the two factors. If a second factor is a candidate for a blocking 
criterion, the Latin square arrangement can be used for the experiment design. 

The Latin square arrangement derives from an arrangement of the Latin letters 
A, B, C, ... into a square array such that each letter appears once in each column 
and once in each row of the square. In applications to experiments, the rows and 
columns of the array are identified with the two blocking criteria and the Latin let- 
ters are identified with the treatments. One such application occurred in the follow- 
ing example. 


Example 8.2 Relationship Between Wheat Yield and Seeding Rate 


Wheat cultivation practices such as seeding rate, row spacing, and date of 
planting have direct effects on the crop yield. Cultivation practices to opti- 
mize production are established with experiments on newly introduced wheat 
cultivars. 


Research Objective: In one such instance, a researcher wanted to determine 
the optimum seeding rate for a newly introduced durum wheat with a high 
semolina extract important to the making of pasta. 


Treatment Design: Five seeding 30, 130, 180, and 230 Ib/ac 
were used for the treatment aides ae Based « on other c cu ava common 0 jhe 
area, these seeding rates should have bracketed the rate for optimum 
production. 


Experiment Design: The experiment was conducted in an irrigated field with 
the water gradient along one direction of the experimental area. In addition, 
the experimental fields on this farm were known to have soil gradients 
created b required to make the land suitable for EnEsnels 


; The seeding rate treatments were randomized to the field plots it in a 
5 x 5 Latin square arrangement. 

The layout of the experimental plots in a Latin square design after ran- 
domization is shown in Display 8.3. The grain yield for each plot in hun- 
dredweight (100 Ib) per acre is shown in each plot along with a treatment 
letter. 

The field row blocks coincided with the irrigation gradient, and the col- 
umn row blocks — — si ss echo a i cna to the irriga- 
tion gradient. The tr ents are arraved in a Latin sat wie 
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Display 8.3 Arrangement of Experimental Plots for the Wheat 
Experiment in a Latin Square Design 


Field Row Column 1 Column 2 Column 3 Column 4 Column 5 __ Irrigation 


{ 


(R] BAO] SFO] FRAT] SE] 


Soil Gradient ———> 


Source: Dr. M. Ottman, Department of Plant Sciences, University of Arizona. 


Other Applications of Latin Squares 


An experiment to test petennier tire ens is a a-classic paelatd used to 
fen ieatat se ee > experiment tests four 


Ses ed 


ane tire 5 aenIeand of sich anton bil, The row and column blocking criteria are 
the automobiles and tire positions, respectively, for the design in Display 8.4. 


Display 8.4 Latin Square Arrangement for Automobile 
Tire Treatments 


Tire Position 


Zach treatment (A, B, C, or D) appears once in each row (auto) and once in 
each coum nn Poet postrign) The rationale for the blocking criteria is that the wear 
on the tires may differ among the automobiles and the positions in which the tires 
are cass tl on the automobile. 


1e blocking arrangement does not have to be rectangular for a Latin square 
aavtraearent't 16" be “isch Gi error variance sPearctior A linear arrangement of 
treatments in a greenhouse experiment or a linear arrangement of treatments pro- 
cessed over time may be ordered according to the Latin square assignment. Display 
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8.5 illustrates a linear arrangement of four treatments with a Latin Square 
assignment. 


Display 8.5 Latin Square Assignment with a Linear 
Arrangement of Experimental Units 


Row i 2 3 4 
A] BL ELD) |B] Cl Dy Al [cpl als] [D[TA[B[C) 
Colum Tg : Pi P eee Suen tt 2. go 


Gradient 


in time or space 


Use Standard Latin Squares to Generate the Designs 


All Latin squares of a specified size can be generated from the standard 
squares. A standard square has the treatment symbols (A, B, C, ...) written in 
alphabetical order in the first row and in the first column of the array. Each treat- 
ment symbol occurs once in each column and once in each row of the array. Only 
one standard square exists for t = 2 or 3 treatments. There are 4 standard Squares 
with ¢ = 4 treatments and 56 standard squares with t = 5. The number of standard 
Squares increases dramatically with the number of treatments since there are 9408 
standard squares with 6 treatments. 

The standard squares for t = 2, 3, and 4 treatments and samples of squares up 
to ¢ = 10 treatments are shown in Appendix 8A. Fisher and Yates (1963) pub- 
lished the complete set of Latin squares for t = 4 through ¢ = 6 treatments along 
with samples of squares for up to t = 12 treatments. 


A standard square of any size can be generated by writing the first row of let- 
ters in alphabetical order. The second row is obtained from the first by shifting the 


first row one letter to the left, moving the letter A to the extreme right-hand posi- 
tion of row 2. The third row is obtained by shifting the second row one letter to the 
left, placing the letter B in the extreme right-hand position of row 3. This process is 
continued for the remaining rows. A standard 6 x 6 Latin Square constructed in this 
manner is 


Thy Wa Mae) SOS 
oe a Cee 
ee cy 
Cy Ce oe Be) core 
Ci We ge the te 
aa ie ee 
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ive endosiy en because it requires the number of treat- 


aa entias error variance; : aan their alue can b de limi 
unless two or more ‘eaeiionn of t a are Peis With treatment numbers 
in excess of t = 8 to 10 the required number of experimental units can become pro- 
hibitive, depending on the circumstances of the contemplated experiments. The 
suitable size for many experiments in the Latin square arrangement includes those 
with t = 5 to 7 treatments. 

Preece (1983) provides a historical background of the Latin square and a 
general discussion of variations on the Latin square design suggested for experi- 
mental work with row—column blocking design. 


How to Randomize the Design 

If all standard Latin squares of size txt are available, randomization is 
accomplished with the following steps: 

Step 1. Randomly select one of the standard squares. 

Step 2. Randomly order all but the first row. 

Step 3. Randomly order all columns. 

Step 4. Randomly assign treatments to the letters. 

All possible randomizations can be generated without including the first row in 
Step 2 if a standard square is randomly selected. If all standard squares are not 
available for selection, then it is recommended in Step 2 that all rows be included in 
the randomization. Not all possible Latin squares are generated in this way but the 


number of possibilities is increased considerably. Suppose the standard square 
selected at Step 1 for the 4 x 4 Latin square experiment with automobile tires is 


AD * D 
B 2 ae 
DS % Saeee . eee « 
Df 8 C 


Step 2. Obtain a random permutation of numbers to order the last three rows: 


Permutation Original Row 
2. z 
l 3 
a 4 


280 
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The placement of the rows for the standard square with row 1 in its original posi- 


tion is 
Original Row 
l A B 2 D 
3 o D A B 
4 D A B C 
2 B C D A 


Step 3. Obtain a random permutation of numbers to order the four columns from 
Step 2. 


Permutation Original Column 
| l 


4 2 
3 3 
2 4 
The placement of the columns for the standard square is 


Original Column 


Ge Oy eh 
pan CE OO a 
OW Pralw 
AF OWN 


Step 4. Obtain a random permutation to assign treatments to the letters. This 
assignment is not necessary if the standard square has been selected at random from 
all possible standard squares. The method of assignment is shown here for illustra- 
tion. Suppose the treatment labels are W, X, Y, and Z: 


Permutation Treatment 


4=D W 
Ze 5 x 
3=C x 
L=A é 


The treatment labels W, X, Y, Z replace the Latin square letters in the order D, B, 
C, A in the randomized arrangement. The final placement of the tire treatments on 
the automobiles and tire positions is 
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Tire Position 


Auto i 3 4 
1 Z W yd X 
Z x Xx rs W 
3 W = i X Zz 
4 Xx Z W ¥ 


Statistical Model and Analysis for Latin Square Designs 


The linear statistical model for an experiment with ¢ treatments in a t x t Latin 
Square design is 


ee j= MT pi = Vj + Th + ei; (8.16) 


ik SAD Gt 


where y;; is the observation on the experimental unit in the 7th row and jth column 
of the design. The row and column effects are p; and y;, respectively; 7; is the 
effect of the kth treatment; and the e;; are random, independent experimental errors 
with mean 0 and variance o”. It is assumed there is no interaction between the treat- 
ments and the rows and columns. 

The notation for totals and means of observations for rows and columns fol- 
lows the usual convention with y; = eS yi; for a row total and y ; = ae y;; for a 
column total. The treatment total will be represented simply as y;, implying a sum 
of the observations over the ¢t experimental units receiving treatment k. Likewise, 
y;, will represent the mean of the observations on the kth treatment. 


Two Sum of Squares Partitions for Blocking 


The sum of squares partitions can be derived from the algebraic identity 


vj —9. = Gi, -9)+G;-7)+G.-¥) 


+ Yig — Gi. — V5 — Ve + 29.) ae 
The deviation of an observation from the grand mean y;; — y_ is expressed as an 
additive sum of: 
e arow deviation Gy 7) 
-e acolumn deviation (7 og 4) 
-e atreatment deviation (q,, — 7 ) 


e experimental error (y;; — Y;, — Y. oo Zu) 
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For example, the means for the wheat experiment in Example 8.2 (shown in Table 
8.7) are used to illustrate deviations for the observation in row | and column 1 with 
treatment 5 = E, as 


e the row 1 deviation: ¥, —y = 54.15 — 54.53 = — 0.38 
e the column 1 deviation: y¥, —y = 52.43 — 54.53 = — 2.10 
e the treatment E deviation: y; —y = 58.88 — 54.53 = 4.35 


e the experimental error for y11:. 


yu — 91. — 91 — Ys + 29, = 59.45 — 54.15 — 52.43 — 58.88 + 2(54.53) = 3.05 


The sum of the four deviations is — 0.38 —2.10+ 4.35 + 3.05 = 4.92, which is 
the same as the total deviation y;; — y_ = 59.45 — 54.53 = 4.92. 

Squaring both sides of Equation (8.17) and summing the terms leads to an 
additive partition of 


SS Total = SS Rows + SS Columns + SS Treatment + S'S Error 


Table 8.6 summarizes the sums of squares in an analysis of variance with expected 
mean squares for fixed treatment effects. 


Table 8.6 Analysis of variance for experiments in a Latin square design 


Source of Degrees of Sum of Mean Expected 
Variation Freedom Squares Square Mean Square 
Total t? — 1 » U(yij -— 9.) 
tj 

Rows t—As. 4 DG; - 39.) MSR 

t 
Columns t{—A a 7) MSC 

j 
Treatments i-1 t Ch —yy MST o? + 0? 
Error (t — 1)(¢ — 2) SS Error MSE o* 


3 The sum of squares for experimental error has been reduced from that of the 
randomized complete block design by an amount equal to SS Rows or SS Col- 
umns at a cost of t — 1 degrees of freedom. The mean square for experimental error 
as an estimate of o” has very few degrees of freedom with a small number of treat- 
ments. Considerable power is lost in the tests of hypotheses for treatment com- 
parisons unless the reduction in error sum of squares due to blocking by both row 
and column criteria is substantial. 

The effectiveness of blocking by either criterion evaluated with the relative 
efficiency measure is demonstrated with the analysis of grain yield from the wheat 
seeding rate experiment in Example 8.2. 
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The observations along with row, column, and treatment means are shown in 

Table 8.7 in the Latin square arrangement. The data are the grain yield for each plot 

in hundredweight (100 Ib) per acre. The analysis of variance is shown in Table 8.8. 


Table 8.7 Grain yield of a wheat variety for five different seeding rates in a Latin 
square design [Treatment label (A, B, C, D, or E) in parentheses following yield 
value] 


Field Column Row 

Field Means 
Row ] 2 3 4 2 (Y;.) 

l 59.45(E) 47.28(A) 54.44(C) 50.14(B) 59.45(D) 54.15 

2 55.16(C) 60.89(D) 56.59(B) 60.17(E) 48.71(A) 56.30 

3 44.41(B) 53. 72(©) 55.87(D) 47.99(A) 59.45(E) 52.29 

4 42.26(A) 50.14(B) 55.87(E) 58.74(D) soe) 52.58 

5 60.89(D) 59.45(E) 49.43(A) 59.45(C) 57.3108) 57.31 
Column Yy 
Means (¥ ;) 
Treatment A B C D E 
Seed rate 30 80 130 180 230 
Mean (¥;) 47.13 $172 55.73 59.17 58.88 


Table 8.8 Analysis of variance for grain yield of a wheat variety at five seeding 
rates ina 5 x 5 Latin square design 


Source of Degreesof Sumof Mean 
Variation Freedom Squares Square F Prot 
Total 24 716.61 
Row 4 99.20 24.80 
Column 4 38.48 9.62 
Seed Rate 4 522.30 1:30:57 27.67 .000 
Error 72 56.63 i 

Standard Errors for Treatment Means 

The standard error estimate for a treatment mean 1s 

[MSE | 4.72 
sz, = = i = as 1.97 (8.18) 
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and the standard error estimate for a difference between two treatment means is 
2MSE 2(4.72) 
5 (7, eS Um) — t = 5 = P57 (8.19) 


Tests of Hypotheses About Treatment Means 


The Fo statistic to test the null hypothesis of no differences among treatment 
means, 

ee eae 
GMSE - 472 


Fy = 27.67 (8.20) 


exceeds the critical value F’o5,.4.12 = 3.26 with observed significance level Pr > F 
= .000 (Table 8.8). 


Interpretations of the Quantitative Treatment Factor with Regression 
Contrasts 


The treatment factor for this experiment is a quantitative factor with five 
levels. An analysis for the regression of grain yield on seeding rate with orthogonal 
polynomial contrasts for seeding rate will provide a good description of the effect 
of seeding rate on grain yield. The coefficients for the orthogonal polynomial con- 
trasts are found in Appendix Table XI. Orthogonal coefficients and the sums of 
squares for the linear and quadratic polynomial regression contrasts are shown in 
Table 8.9. 


Table 8.9 Linear and quadratic polynomial regression sums of squares partitions 
for seeding rate (Example 8.2) 7 


Seeding Rate 30 80 130 180 230 SS* 
Mean (¥;) 47.13 8 ae ae.13 59.17 58.88 

Linear (Pi) —2 —] 0 ] 2 = Bie 
Quadratic (Po;) 2 —1 —2 —1 2 Je. 


"SS = riivag, ols 


The respective Fo statistics to test the null hypotheses for the linear and qua- 
dratic contrasts are Fy = 478.95/4.72 = 101.47 and Fo = 38.11/4.72 = 8.07. Both 
ratios exceed the critical value F'95,1.12 = 4.75. The sum of squares for deviations 
from linear and quadratic regression is 


SS Seed(deviations) = SS Seed — SS Seed(linear) — S.S Seed(quadratic) 


= 522.30 — 478.95 — 38.11 = 5.24 
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with 2 degrees of freedom; a test of hypothesis would indicate no significant devia- 
tions from the quadratic equation. 

The computed quadratic polynomial regression equation to estimate grain yield 
(7%) from seeding rate (R), using the techniques described in Chapter 3, is 


G = 43 + 0.14R — 0.0003 R? 


A plot of the equation is shown in Figure 8.3. The maximum estimated grain yield 
occurs for a seeding rate of R = 233 pounds per acre, which is an extrapolation be- 
yond the highest rate used in the experiment. Another experiment with a seeding 


rate above 230 pounds per acre would be required to safely estimate the maximum 
seeding rate. 


Estimated grain yield 


50 100 150 200 
Seeding rate 


Figure 8.3 Estimated response between grain yield and seeding rate 


Analysis of Residuals to Evaluate Assumptions 


Residuals can be used to evaluate the assumptions of the model (as discussed 
in Chapter 4). From Equation (8.17), the residual for the observations on the kth 
treatment in the zth row and jth column is 


By Vg AH Os, FH, 


The residual plots are left as an exercise for the reader. 
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If Treatments Interact with Blocks 


Tukey (1955) devised a test for the assumption of additive treatment, row, and 
column effects. From Equation (8.17), the additivity of treatment, row, and 
column effects results in the estimated value of an observation as 

Go, 7 8 eo te 


The computation of a 1 degree of freedom sum of squares for nonadditivity 
requires the values of %;; for each experimental unit and the residual for each 
experimental unit, €;; = y;; —y;;. The sum of squares for nonadditivity is 


(8.21) 


| 3S is the sum of squares for error from the Latin square analysis of variance 
on wi The Fo statistic to test the hypothesis for nonadditivity is 


(8.22) 


where 


has v = (t — 1)(t — 2) — 1 degrees of freedom and SSE is the sum of squares for 
error from the Latin square analysis of variance for the original observations y;;. It 
is recommended that the values for%,; be coded as k(y,; — y_), where k is a con- 
stant to scale the size of the values for computational convenience. 


Did Both Blocking Factors Increase Precision? 


The efficiency of the Latin square design with two blocking criteria is determined 
relative to the randomized complete block design with only one blocking criterion. 
Relative efficiency measures can be computed separately for the row and column 
blocking criteria of the Latin square. 


Relative Efficiency of Column Blocking 


If only the row blocking criterion is used for blocking in a randomized com- 
plete block design, the estimated mean square for error is 
MS Columns + (¢—- 1I)MSE | 
Shag = a (8.23) 
where MSE is the mean square for error from the current Latin square analysis of 
variance. 
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The estimated value for Example 8.2 is s?,, = [9.62 + 4(4.72)|/5 = 5.70 and 
s7, = MSE = 4.72. The relative efficiency of column blocking for the experiment 
1S 

’ 5.70 


S 
RE-o = 22 = —— = 1.21 
s? 4.72 


ls 


There is a 21% gain in efficiency over the randomized complete block design in 
which only the row criterion of the Latin square design is used for blocking. Thus, 
the column blocks for soil gradients across the field effectively reduced the error 
variance by 21%. The randomized block design without the column blocks for soil 
gradients would require 1.21(5) = 6 replications to have an estimated variance of 
the treatment mean equal to that from the Latin square design. 


Relative Efficiency of Row Blocking 


If only the column criterion is used for blocking in a randomized complete 
block design, the estimated mean square for error is 
MS Rows + (t—- )MSE 
2, - eo (8.24) 
For Example 8.2, s?,, = [24.80 + 4(4.72)|/5 = 8.74 and RE,ow = 8.74/4.72 = 
1.85. There is an 85% gain in efficiency with row blocking for the irrigation gradi- 
ent in the experiment. Without row blocking for irrigation gradient the experiment 
would require 1.85(5) = 9.25 or 10 replications of each treatment in the ran- 
domized complete block design to have an estimated variance of the treatment 
equal to that from the current Latin square design. 


Correction for Estimating a7 
The correction for estimating 0” by s? is 


(fis a8 DS rct + 3) a 9 — 0.97 


Ch £30k Ue 2 13-12 


where f;, = 12 and f,., = 16 are the error degrees of freedom for the Latin square 
and randomized complete block design, respectively. The correction reduces the RE 
from 1.21 to 0.97(1.21)= 1.17 for column blocking and from 1.85 to 
0.97(1.85) = 1.79 for row blocking. The correction has a small effect on the 
efficiency estimates. 


Multiple Latin Squares and Latin Rectangles 


The Latin square design with two blocking criteria and four or less treatments is 
very restrictive and provides too few degrees of freedom for an effective estimate 
of experimental error variance. It is common to repeat the experiment with more 
than one square under these circumstances. 
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There are two distinct forms of the design with multiple squares. The first form 
has distinct row and column identification for each square in the experiment. The 
second form has either the row or the column identification common to all squares. 
An example of the first form occurs in agricultural field trials in which two Latin 
squares are utilized in separated areas of the research farm. The second form is 
exemplified with the automobile tire experiment in which two groups of four auto- 
mobiles are used for two Latin square arrangements, with the eight automobiles 
representing column blocking and the four tire positions common to both squares 
representing row blocking. This latter form of the design is known as the Latin 
rectangle. The two forms are illustrated in Figure 8.4. 

Figure 8.4a is representative of the agricultural field experiment with multiple 
unique Latin squares and Figure 8.4b is representative of the automobile tire experiment 
with positions as rows and automobiles as columns in a Latin rectangle. 


Column 
Row «4d 2 E 4 ] 6 fg 8 
| A B 2 D 
2 B S D A 
3 .. D A B 
(a) 4 D A B 2s 
5 A B . D 
6 B A D C 
7 S D B A 
8 D c A B 
Column 
Row 1 ae 4 3 6 Z 8 
1 A B G D A B c D 
(b) 2 B ‘s D A B A D & 
3 3 D A B C D B A 
4 D A B Ss D Cc A B 


Figure 8.4 Multiple Latin squares with (a) rows and columns 
unique for each square and (b) the Latin rectangle with rows 
common to both squares and columns unique to each square 


Randomization is performed separately for each square when row and column 
blocking is unique for each of the s squares. With the Latin rectangle it is also pos- 
sible to consider each square unique and randomize accordingly. When the row cri- 
terion is consistent across columns it is only necessary to require that each 
treatment occur s times in each row and one time in each column. The randomiza- 
tion then consists of one randomization of the t rows and a separate randomization 
of the entire set of st columns. 

The linear model for the Latin rectangle with unique row and column blocking 
criteria in each Latin square is 
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Yigl = KI Pal) + Y5(2) + Tk + eagl = 25) 
fe 


where «, is the square effect, and pj,j) and 7,j) are the row and column effects 
nested in the squares, respectively. It may be necessary to consider a 
square x treatment interaction component in the model if it is suspected that the 
treatment comparisons may differ from one square to another. A discussion of an 
experiment by treatment interaction is found in Section 8.6. An outline of the 
analysis of variance with unique row and column blocking is shown in Table 8.10. 


Table 8.10 Analysis of variance for an experiment repeated with s unique Latin 
square arrangements : 


Source of Degrees of Sum of 

Variation Freedom Squares 

Total st? — | % = Uys — 9.) 

a 

Squares s—1 t? ~ wi- ¥.)? 

Rows within squares s(t — 1) td Yo (yt - ¥ 7)" 
t 

Columns within squares s(t — 1) t » UG a- 9p 
j 

Treatments t—1 3 st » Y,- yy 

Error (st —s—1)t—- 1) By subtraction 


The linear model for the Latin rectangle that has common row blocking criteria 
for s complete Latin squares and unique column blocking is 


Yij = Mt Pit V5 + Th Cis (8.26) 
if eles eal het = 192, cht. 


An outline of the analysis of variance for common row and unique column block- 
ing is shown in Table 8.11. The standard error estimates for treatment means and 


differences between two treatment means, respectively, are ,/MSE/st and 


J/2MSE/st. 


8.4 Factorial Experiments in Complete Block Designs 


The treatment design used to address the research hypothesis can be placed in any 
compatible experiment design. The appropriate sum of squares partitions can be 
computed for the analysis of variance as long as the randomization restrictions are 
followed for the experiment design in question. 
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Table 8.11 Analysis of variance for an experiment with ¢t treatments in a Latin 
rectangle with ¢ rows and st columns 


Source of Degrees of Sum of 


Variation Freedom Squares 

Rows tot st (Y;,— 9.) 

Columns | st — 1 1G ;- 9) 

Treatments t—1. st Ge Gy 
_ Error (st — 2)(t — 1) By subtraction 


The two-factor factorial in a randomized complete block design has all ab 
treatment combinations an equal number of times in each block. With the Latin 
square design each of the ab treatment combinations appears one time in each row 
and one time in each column. A Latin rectangle design with a common row criter- 
ion for s squares requires each treatment combination to appear one time in each 
column and s times in each row. 

The linear model for a two-factor factorial, factor A with a levels and factor B 
with 6 levels, in a randomized complete block design with r blocks is 


Vigh = pt + pe + a: + 6; HOB) + eign (8.27) 
Pt a 


The sum of squares partitions for a two-factor factorial in a randomized 
complete block design are illustrated in Table 8.12. The treatment sum of squares 
with (t — 1) = (ab — 1) degrees of freedom is partitioned into sums of squares for 
the main effects of factors A and B and interaction effects as described in Table 
6.5. The sum of squares for blocks and experimental error are analogous to those 
shown for the randomized complete block design in Table 8.2. Standard errors are 
computed according to the conventions outlined in Chapter 6 for the factorial 
experiment in a completely randomized design. 

A similar pattern of analysis is followed for the Latin square and Latin rectan- 
gle in which the square, row, and column sums of squares are partitioned according 
to the analyses in Tables 8.6, 8.10, or 8.11 with t = ab. The sum of squares parti- 
tions for factorial main effects and interactions are computed as described in 
Chapter 6. ay 
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Table 8.12 Analysis of variance for a two-factor treatment design in a randomized com- 
plete block experiment design 


~ Source of Degrees of Sum of Mean Expected 
Variation Freedom Squares Square Mean Square 
Total rab — | SS Total 
Blocks r—1 SS Blocks 
A ol SSA MSA o* + rbé? 
B b—1 SSB MSB o* + rae? 
AB (a — 1)(6 — 1) SS(AB) MS(AB) o* + r6?, 
Error (ab — 1)(r — 1) SSE MSE o? 


*See Chapter 6 for computational formulae of SSA, SSB, and SS(AB). SSE is obtained by 
subtraction. SS Blocks = abu (¥,— 9)’. 


8.5 Missing Data in Blocked Designs 


Missing data in research studies were discussed with factorial treatment designs in 
Chapter 6. Missing observations affect the relationships between blocks and 
treatments just as they affected the relationships between factors in the factorials. 

The effects of treatments and blocks are nonorthogonal when data are missing. 
A contrast for one set of effects conveys some information about the other set of 
effects. Thus, the effects for blocks must be considered when computing the sum of 
squares partition for treatments. 

Full and reduced alternative models are used for the complete block design to 
(1) compute unbiased sum of squares partitions for treatments and experimental 
error and (2) compute unbiased least squares estimates of treatment means and their 
standard errors from the available data. The procedures for least squares solutions 
to the normal equations and the sums of squares for the full and reduced models 
were outlined in Chapter 6. Many of the present-day computer programs for the 
analysis of variance are capable of performing this analysis. 


Analysis with Missing Data in Complete Block Designs 


The analysis with missing data for the randomized complete block design will 
differ slightly from that for the factorial design described in Chapter 6 because the 
- randomized complete block model assumes no interaction effects. Thus, the analy- 
sis omits any test for interaction prior to estimating the treatment effects. 
Solutions to the normal equations for the full model, y;; = +7; + p; - ee 
are used to compute the estimates, %,; = +7; +p,;, and the experimental error 
sum of squares for the full model, 


Sele = via — te ty = My); 
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Solutions to the normal equations for the reduced model, y;; = 4+ p;+ ej; , are 
used to compute the estimates, 7,; = 2+ p,;, and the experimental error sum of 
squares for the reduced model, 


SSE, = O (yj -—-B,Y 
The treatment sum of squares is computed as 


S'S Treatment (adjusted) = SSE, — SSE; (8.28) 


and it represents the reduction in sum of squares as a result of including 7; in the 
full model. It is referred to as the adjusted treatment sum of squares, implying that 
block effects are also considered when estimating the treatment effects in the full 
model. 

The orthogonal sum of squares partitions with m missing observations are 
shown in Table 8.13. The reduced model ignores the treatment classification, and 
S'S' Blocks (unadjusted) from the reduced model is the sum of squares due to dif- 
ferences among block means ignoring treatments. 

The application of an analysis with missing observations is reserved for an 
exercise at the end of the chapter. 


Table 8.13 Analysis of variance for a randomized complete block design with m 
missing observations 


Source of Degrees of Sum of 

Variation Freedom Squares 

Total | tr —m—1 Ud (yi; — 9.) 
tj 

Blocks (unadjusted)* r—1l aly oy 
j 

Treatment (adjusted) t—1 SOL, ~ aah 

Error (r—1)\(t-—1)—m SSE; 


*S'S Blocks (unadjusted) = SS Total — SSE,; n; = number of observations in jth block 


8.6 Experiments Performed Several Times 


Experiments are repeated at several places or on several different occasions for 
various reasons. Repetition over time or space provides a form of replication to in- 
crease the precision of treatment mean estimates or increase the degrees of freedom 
for estimates of experimental error. Repeated experiments can provide an expanded 
inference base to evaluate treatments over a broader range of conditions. In other 
cases, the magnitude of treatment comparisons is expected to differ among places 
or times. The series of experiments are used to examine the variation in treatment 
differences relative to the environmental changes. 

Whatever the reason for conducting the same experiment over a series of times 
or places, a certain amount of discretion must be exercised before we automatically 
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combine the data from the series to conduct a single overall analysis. The following 
example will help clarify some caveats with a series of experiments. 


Example 8.3 Efficiency of Water Use by Bermuda Grass 


Bermuda grass is used extensively for lawns, parks, and golf courses in warm 
and dry climates. In dry climate areas maintenance of the Bermuda grass re- 
quires regular irrigation. Turf managers want species of plants that efficiently 
utilize water to reduce maintenance costs and conserve water. Considerable 
variation in water use efficiency exists among species and within species. 


Research Objective. A plant breeder wanted to determine the amount of 
variability in water use efficiency of Bermuda grass that he could attribute to 
genetic differences. Given sufficient genetic variability he could initiate a 
breeding program to develop a water-use efficient hybrid Bermuda grass. 


Treatment Design: The plant breeder made all possible (30) hybrid crosses 
between six Bermuda grass cultivars in what is known as a diallel mating de- 
sign. This particular mating design allows the plant breeder to evaluate the 
genetic potential of specific cultivars or the population represented by the 
cultivars in the designs. 


Experiment Design: He grew the progeny and the six parental cultivars in a 
randomized complete block design with two blocks at each of four separate 
locations in a field plot experiment. Two of the crosses did not produce prog- 
eny, so there were 34 plots in each block of each experiment. 

The plant breeder measured the total dry matter produced by the plants in 
each plot and the amount of water utilized on the plot to produce the plant 
material. The measurement he utilized for analysis was the ratio of water used 
to total dry matter production on each plot. The analysis of variance with 
source of variation, degrees of freedom, and mean square for each of the four 
experiments is shown in Table 8.14. 


Table 8.14 Analysis of variance for water use in each of four experiments from a 
diallel mating design with six Bermuda grass cultivars 


Source of — Degrees of Mean Squares by Location 
Variation Freedom ] 2 ) 4 
Block l 2.80 20,07 ie 1.81 
Genotypes i 1.08 17.85 1,92 1339 
Error 33 1.61 10.56 1.07 0.74 


Source: Dr. W. Kneebone, Department of Plant Sciences, University of Arizona. 


Location 2 Is Different 


The most noticeable feature of the analyses in Table 8.14 is that the mean squares 
for all sources of variation at location 2 are considerably larger than those for the 
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other three locations. In particular, the error variance at location 2, MSE = 10.56, 
is considerably larger than the error variance at the other locations. The greater 
variability at location 2 is an indication that the experimental conditions there may 
have been quite different from other locations. 


Homogeneous Variances Required to Combine Results of Several Experiments 


There were four experiments, each with two replications, for a total of eight repli- 
cations. If the four experiments were combined into one analysis the estimates of 
the genotype means would be much more precise. 

Homogeneity of error variances is required for a combined analysis of variance 
for the four experiments. A homogeneity of variance test for the error variances 
with the F’ Max test (Chapter 4) rejects the hypothesis of equal error variances for 
the four locations. One possible solution is a transformation to a logarithmic scale 
to achieve homogeneous variances prior to a combined analysis of variance. The 
analyses of variance for each location after the logarithmic transformation are 
shown in Table 8.15. 


Table 8.15 Analysis of variance for 10[logio(water use)] in each of four 
experiments from a diallel mating design with six Bermuda grass cultivars 


Source of Degrees of . Mean Squares by Location 
Variation Freedom 1 2 3 4 
Block l 1.68 3.28 1.42 1.01 
Genotypes 33 0.56 Bie | 1.09 Li 
Error a3 0.82 1.54 0.56 0.51 


The logarithmic transformation considerably reduced the disparity in the error 
variances. However, the observed mean squares for all sources of variation are still 
somewhat larger at location 2. The combined analysis still should be approached 
with caution, and an interpretation of the results initially should be made from each 
of the separate experiments. The Fo statistics at each location to test the hypothesis 
of no differences among the genotypes, Fy = MS Genotypes/MS Error, result in 
the rejection of the null hypothesis at the .05 level of significance for locations 3 
and 4. The null hypothesis is not rejected at locations 1 and 2. The same conclu- 
sions are reached from the Fp statistics with the untransformed data in Table 8.14. 


Statistical Model and. Analysis of Variance for Combined Analysis 
of Experiments 


The results of the F' tests indicate a differential performance among the genotypes 
at the four locations. The combined analysis should include the possibility of a 
genotype x location interaction. The linear statistical model for the combined 
analysis with random genotype and location effects is 
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Yijk = WT P+ 554) + gx + (gp)izk + Cijk (8.29) 


where yp is the general mean, p; is the random location effect, b,;) 1s the random 
block within location effect, g, is the random genotype effect, (gp);, is the random 
effect for genotype x location interaction, and e;,, is the random error effect. The 
combined analysis of variance for the transformed data from the four experiments 
is shown in Table 8.16 with expected mean squares for random genotype effects 
and either fixed or random location effects. | 


Table 8.16 Analysis of variance for 10[log; (water use)] for combined experi- 
ments from a diallel mating design of six Bermuda grass cultivars 


Source of Degrees of Sum of Mean Expected 
Variation Freedom Squares Square Mean Square 
Locations (L) 3 433.63 144.54 

Blocks/L 4 7.39 1.85 

Genotypes (G) 33 81.11 2.46 ae hg + tle: 
eee? 99 93.01 0.94 o7 + roe) 
Pooled error 132 113.19 0.86 a” 


The mean square for error in the combined analysis is the pooled error from 
the four experiments (average of the mean squares for error from the four experi- 
ments). The blocks are unique to each experiment and they are a nested-factor 
effect for the analysis of variance. The sum of squares for blocks within locations is 
the sum of the block sum of squares from the individual analyses of variance. 

The expected mean squares for the random or mixed model were derived using 
guidelines given in Chapter 7. If genotypes (treatments) are fixed effects, then a; is 
replaced by the equivalent component for fixed effects, ee When both locations 
and genotypes (or treatments) are fixed, then o%) is also replaced by the equivalent 
_component for fixed effects, 0, in Table 8.16. 

Whether the repetitions of experiments over time and places are random or 
fixed effects depends upon the objective of the repetition. If the repetitions are 
chosen to investigate the treatment responses to deliberate changes in environment, 
then fixed effects models for places or time, seem appropriate. If the repetitions can 
be justified as legitimate random representatives of places or time, then the random 
effects model can be used for places or time. It is perhaps most difficult to consider 
the repetitions random if only a limited number of places are available for experi- 
ments or a sequence of successive weeks, months, or years represent the time 
repetition. 


Tests of Hypotheses in Combined Analysis 


The hypothesis of no genotype x location interaction is tested with the Fo statistic 
in Table 8.16 as Fo = MS(G x L)/MSE = 0.94/0.86 = 1.09, and critical value 
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F’'o5,99,132 = 1.36. The null hypothesis of no genotype x location is not rejected. 
The hypothesis of no genotype effects is tested with the Fo statistic Fy = 
MSG/MSE = 2.46/0.94 = 2.62 with critical value F'o5,33.99 = 1.55. The null 
hypothesis of no genotype effects is rejected. 


Dissecting Treatment < Experiment Interaction 


The nonsignificance of genotype x location interaction should not be taken lightly 
since an analysis of the separate experiments revealed significant genotype differ- 
ences at the .05 level of significance at only two of the locations. It is entirely pos- 
sible that certain treatment comparisons interact with the environment while other 
comparisons are relatively constant across environments. Three sets of comparisons 
important in the Bermuda grass diallel mating design are 


e comparisons among the six parent cultivars 
e comparisons among the 28 crosses 


e acontrast between the mean of the six parent cultivars and that of the 28 
"crosses 


A separate analysis of variance for the parent cultivars provides the sums of 
squares for the first set of comparisons. A separate analysis of variance for the 
crosses provides the sums of squares for the second set of comparisons. The linear 
model in Equation (8.29) is used for the analysis. It is the same model used when 
all genotypes were included in the analysis. Only those sums of squares required 
for the comparisons of interest are shown in Table 8.17. The sums of squares for 
locations and blocks within location are not shown. 


Table 8.17 Separate analyses of variance for parents and crosses from a diallel 
mating design of six Bermuda grass cultivars 


(1) Parents Analysis 


Source of Degrees of Sum of Mean 
Variation Freedom Squares Square 
Parents 2 3.45 0.69 
Parents x locations 15 4.87 0.32 
Error (P) 20 10.04 0.50 
(2) Crosses Analysis 
Source of Degrees of Sum of Mean 
Variation Freedom Squares Square 
Crosses af 46.06 171 
Crosses x locations 8] 79.56 0.98 


Error (C) 108 103 22 0.94 
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The sum of squares for the contrast between the means of the parents and 
crosses and the sum of squares for the interaction between the contrast and 
locations can be computed by subtraction. Utilizing the sums of squares from 
Tables 8.16 and 8.17, they are 


(1) Parents versus Crosses 


SS(P versus C) = SSG —SSP — SSC = 81.11 — 3.45 — 46.06 
= 31.60 


(2) (Parents versus Crosses) x Locations 


SS[(P versus C) x L] = SS(G x L) — SS(P x L) — SS(C x L) 
= 93.01 — 4.87 — 79.56 
= $50 


The separate analyses of variance for parent cultivars and the crosses also pro- 
vide separate sums of squares for experimental error that can be identified with 
each set of comparisons. A partition of the error sum of squares can be useful be- 
cause the experimental errors can differ considerably among the several compari- 
sons. The experimental error mean squares for parents and crosses, Error (P) and 
Error (C), from the separate analyses are found in Table 8.17. 

The sum of squares for experimental error associated with the contrast between 
parents and crosses is found from the error sums of squares in Tables 8.16 and 8.17 
as 


SS Error(P versus C) = S'S Error — SS Error(P) — SS' Error(C) 
= 113.19 — 10.04 — 101.22 
= bO3 


A summary of the analysis of variance with all of the sum of squares partitions 
is shown in Table 8.18. The three mean squares for the partition of experimental 
error shown at the bottom of Table 8.18 have similar values. The pooled error with 
132 degrees of freedom probably can be used with some confidence that the error 
variances of the three groups of comparisons are relatively homogeneous. 


To Pool or Not to Pool Variances 


The decision to use the partitioned error term or the pooled error can have an effect 
on the tests of hypotheses. Tests with the pooled error have larger degrees of 
freedom and, therefore, greater ability (power) to detect differences than the 
partitioned mean square error. This effect can be seen in Table 8.18 with a test of 
the (P vs. C) x L interaction. The test statistic with the pooled error is fo = 

2.86/0.86 = 3.33 with critical value F'o5,3,132 = 2.67. The test statistic with the par- 
titioned mean square error is Fo = 2.86/0.48 =5.96 with critical value 
Fo5,3,4 = 6.59. In the latter test with fewer degrees of freedom for mean square 
error the null hypothesis is not rejected. The hypothesis is rejected with the pooled 
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Table 8.18 Analysis of variance for 10[logio(water use)| for combined experi- 
ments from a diallel mating design of six Bermuda grass cultivars : 


Source of Degrees of Sum of Mean 
Variation Freedom Squares Square 
Locations (L) 3 433.63 144.54 © 
Blocks/L 4 139 1.85 
Genotypes (G) 33 81.11 2.46 

Parents (P) 5 3.45 0.69 

Crosses (C) 27 46.06 71 

PvsC l 31.60 31.60 
Gel 99 93.01 0.94 

fee 15 4.87 0.32 

Ct 81 79.56 0.98 

CPS CX EL | 3 8.58 2.86 
Pooled Error 132 Pigst9 0.86 

Error (P) 20 10.04 0.50 

Error (C) 108 101.22 0.94 

Error (P vs. C) 4 1.93 0.48 


mean square error. It should be recognized that the null hypothesis would not be 
rejected for either of the tests at the 0.01 significance level. The difference between 
the two tests is not dramatic in this particular case, but it is necessary to be aware of 
the two possibilities for the test. 

In this particular example the results of other F’ tests are not affected by which 
error term is used, partitioned or pooled. Neither of the other G x L interaction par- 
titions is significant. Although the overall genotype x location interaction was not 
significant, there appears to be some evidence that a component of the interaction 
was significant. This result may explain the differences in significance levels of the 
tests for genotypes from the separate location analyses in Table 8.15. 


More Details to be Found 


McIntosh (1983) provided analysis of variance tables with sources of varia- 
tion, degrees of freedom, and appropriate Fo statistics to test hypotheses for an ex- 
tensive array of experiments with fixed, random, and mixed models combined over 
time, places, or a combination of time and places. Carmer, Nyquist, and Walker 
(1989) provided formulae for the estimation of variances for pairwise mean differ- 
ences from combined experiments with two- and three-factor treatment designs. The 
experiments had randomized complete block designs with fixed treatment effects and 
random time and place effects. 
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EXERCISES FOR CHAPTER 8 


1. An irrigation experiment was conducted in a randomized complete block design in a Valencia 
orange grove. Six irrigation treatments were used in eight blocks of trees. The data that follow are 
the weight in pounds of harvested fruit from each plot. 


e. 


ft: 


Block 

Method ‘4 2 a 4 7 6 7 S 
Trickle Aa Ae 2 28. et Zl 
Basin 200° Sie = gel Se ek. 29s. 20s Fe0 
Spray 331 402 183 20 2s , ee 2 46 
Sprinkler at? Ala. 3) O42 09s. So 
Sprinkler + 

Spray ayy 341 - 404. 145-182 349. 276 182 
Flood 245 3580 263 e770 Zez Tit 98 


Source: Dr. R. Roth and Dr. B. Gardner, Department of Soil and Water Science, 
University of Arizona. 


Write a linear model for the experiment, explain the terms, and compute the analysis of 
variance. 

What are the assumptions necessary for the analysis of variance to be valid? How do they 
relate to the experiment? 

Compute the standard error estimate for an irrigation treatment mean and the difference 
between two irrigation treatment means. 

Consider the flood irrigation method to be the standard practice. Use the Dunnett method to 
test the difference between the flood irrigation and each of the other methods. 

Compute the relative efficiency of this design relative to a completely randomized design. 
What are your conclusions? 

Obtain the residual plots from the analysis, and interpret them. 


2. A fertilizer trial on a range grass, blue grama, was conducted in a randomized complete block de- 
sign by a management scientist. Five fertilizer treatments were randomly assigned to plots in each 
of five blocks. The data are 100 x (percent phosphorus) in a plant tissue sample from each plot. 


Block 
Treatment q Z 3 4 | 5 
No fertilizer 7.6) 331 Pee GAL GA 


50 lb. nitrogen TOS Ye! SPST A AZZ 
100 Ie. nitrosen «6-669 «=«.60 == 56 = «674° ~610 
50 lb nitrogen 

+ 75 Ib P05 fe 112: 90 2S tS 
100 lb nitrogen 

+ 75 Ib P2O5 G6 ° 92 120° 106. 104A 


Source: Dr. P. Ogden, Range Management, University of Arizona. 
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a. Write a linear model for this experiment, explain the terms, and compute the analysis of 
variance. | 

b. Compute the 1 degree of freedom sum of squares for each contrast that follows, and test the 
null hypothesis for each. The four orthogonal contrasts among the five treatments are (1) no 
fertilizer versus the four fertilizer treatments, (2) the main effect of nitrogen, (3) the main 
effect of P2Os, and (4) the interaction between nitrogen and P2Os. 

c. Compute the standard error for each of the contrasts in part (b). 

. Compute the relative efficiency of the randomized complete block design. 
e. Obtain the residual plots from the analysis, and interpret them. 


3. The self-inductance of coils with iron-oxide cores was measured under different temperature 
conditions of the measuring bridge. The coil temperature was held constant. Five coils were used 
for the experiment. The self-inductance of each coil was measured for each of four temperatures 
(22°, 23°, 24°, and 25°) for the measuring bridge. The temperatures were utilized in a random 
order for each coil. The data are percentage deviations from a standard. 


Coil 
Temperature ] Ps 3 - 2 
22 1.400 0.264 0.478 1.010 0.629 
22 1.400". °0.235 “Uae 0.990 " 0620 
24 L373. 0.212 — Osa 0,968 0495 
25 1.370 0.208 0.440 0.967 0.495 
Source: H. Hamaker (1955), Experimental design in industry. Biometrics 


11, 25 7—236. 


a. Write a linear model for this experiment, explain the terms, and compute the analysis of 
variance. 

b. What are the assumptions necessary for the analysis of variance to be valid? How do they 
relate to the experiment? 

c. Compute the orthogonal polynomial regression contrasts for temperature and their sums of 
squares. Determine the best-fitting equation for the data. 
Compute the relative efficiency of using coils as blocks. 

e. Obtain the residual plots from the analysis, and interpret them. 


4. A traffic engineer conducted a study to compare the total unused red light time for five different 
traffic light signal sequences. The experiment was conducted with a Latin square design in which 
the two blocking factors were (1) five randomly selected intersections and (2) five time periods. In 
the data table the five signal sequence treatments are shown in parentheses as A, B, C, D, E, and 
the numerical values are the unused red light times in minutes. 


c. 


Intersection 


i 
15.2(A) 
16.5(B) 
12.1(C) 
10.7(D) 
14.6(E) 


2 
33.8(B) 
26.5(C) 
31.4(D) 
34.2(E) 
31.7(A) 


Time Period 


3 
13.5(C) 
19.2(D) 
17.0(E) 
19.5(A) 
16.7(B) 


4 
27.4(D) 
25.8(E) 
31.5(A) 
27.2(B) 
26.3(C) 


5 
29.1(E) 
22.7(A) 
30.2(B) 
21.6(C) 
23.8(D) 
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Source: Mason, Gunst, and Hess (1989), 393. 


Write a linear model for this experiment, explain the terms, and compute the analysis of 
variance. . 

Compute the standard error for a signal sequence treatment mean and for the difference 
between two signal sequence treatment means. 

Use the Multiple Comparisons with the Best procedure to select the set of signal sequences 
with the shortest unused red light time. 

What is the relative efficiency of blocking by time periods? 

Obtain the residual plots from the analysis, and interpret them. 


A research engineer studied the time efficiency of four construction methods (A, B, C, D) for an 
electronic component. Four technicians were selected for the study. The construction process pro- 
duces fatigue such that the required construction time by the technicians increases as they change 
from one method to another regardless of the order of construction methods. The engineer used a 
Latin square design with columns as “technician” and rows as “time period.” The construction 
methods were randomized to the technicians and time periods according to the Latin square ar- 
rangement. The values are construction times in minutes required for the component with the con- 
struction method indicated in parentheses. 


Technician 
Time Period 1 Z 3 4 
| 90°C) 96(D) 84(A) — 88(B) 
2 90(B) 91(C) 96(D) 88(A) 
3 89(A) 97(B) 98(C) 98(D) 
4 104(D) 100(A) 104(B) 106(C) 


Write a linear model for this experiment, explain the terms, and compute the analysis of 
variance. 

Compute the standard error for a construction method mean and the difference between two 
construction method means. 

Use the Tukey method to make all pairwise comparisons between the means of the 
construction method times. 

Use the relative efficiency measure to determine whether the time period was a critical 
blocking factor to reduce experimental error variance. 


The experiment in Example 8.2 on the relationship between wheat yield and seeding rate was con- 
ducted in a 5 x 5 Latin square design. A second replication of the experiment was conducted in an 
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adjacent field of the experimental farm. Consequently, there are two repetitions of the experiment 
with unique row and column blockings for each of the two experiments. The data for the second 
experiment follow. The seeding rates were 30, 80, 130, 180, and 230 for A through E, respectively. 
The grain yield for each plot is expressed in hundredweight (100 pounds) per acre. 


ee 


h. 


Field Field Column 

Row 7 2 5 4 2 
| 26.88(A) 38.40(D) 35.33(E) 34.56(C) 24.57(B) 
Z 37.63(E) 24.57(A) 36.09(C) 23.81(B) 32.25(D) 
3 29.95(C) 29.18(B) 33.02(D) 22.27(A) 33.02(E) 
4 32.25(D) 31.49(C) 21.50(B) 33.02(E) 18.43(A) 
5 26.11(B) 36.09(E) 23.81(A) 29.95(D) 29.95(C) 


Source: Dr. M. Ottman, Department of Plant Sciences, University of Arizona. 


Write the linear model: for the second experiment, explain the terms, and compute the analysis 
of variance. 

Compute the standard error estimates for a seeding rate mean and the difference between two seeding 
rate means. 

Compute the relative efficiency of row blocking for this experiment. Would more replications 
be required for a randomized complete block design using only the columns as blocks? If so, 
how many more would you recommend? 

Compute the linear and quadratic orthogonal polynomial regression sum of squares partitions 
for seeding rate, and test their null hypotheses. Are the deviations from a linear or a quadratic 
relationship significant? 

Do you think it is reasonable to perform an analysis of variance for the two experiments com- 
bined? Explain. 

Compute the analysis of variance for the two experiments combined. 

Compute the standard error estimates for a seeding rate mean and the difference between two 
seeding rates from the combined experiments. 

Is there an interaction between seeding rate linear or quadratic contrasts and experiments? 


7. A horticulturalist conducted a nitrogen fertility experiment for lettuce in a randomized complete 
block design. Five rates of ammonium nitrate treatments (0, 50, 100, 150, and 250 Ib/acre) were 
randomly assigned to each of two plots in each of two blocks for a total of four plots for each level 
of nitrogen. Each block consisted of ten plots, two plots for each treatment in each block. The data 
are the number of lettuce heads from each plot. 


Nitrogen Block 1 Block 2 
0 104 114 #1109 124 
50 134° 1s oS oO 164 
100 146-1 ES 
150 iT en eo GS 
200 tos te ee 


Source: Dr. W.D. Pew, Department of Plant Sciences, 
University of Arizona. 
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a. Write the linear model for the experiment, explain the terms, and compute the analysis of vari- 
ance. Note that there are multiple plots for each treatment in each block. How does this affect 
your estimates of experimental error from the analysis of variance? 

Test the assumption of no block x treatment interaction. 

c. Compute the linear and quadratic polynomial regression contrasts sum of squares partitions for 
nitrogen, and test the null hypotheses. Interpret the results. 

d. Are cubic deviations significant? 


A field plot experiment was conducted to evaluate the interaction between timing of nitrogen appli- 
cation to the soil (early, optimum, late) and two levels of a nitrification inhibitor (none, .5 Ib/acre). 
The inhibitor delays conversion of ammonium forms of nitrogen into a more mobile nitrate form to 
reduce leaching losses of fertilizer-derived nitrates. The nitrogen was supplied as pulse-labeled 1°N 
through a drip irrigation system at an early, an optimum, and a late date of application. The data are 
percent of ‘°N taken up by sweet corn plants grown on the plots. 


Nitrogen Inhibitor 
None .5 lb/acre 
Block Early Optimum Late Early Optimum Late 
| 21.4 50.8 53.2 54.8 56.9 = 
Z 1L3 42.7 44.8 47.9 46.8 54.0 
3 34.9 61.8 57.8 40.1 57.9 62.0 


Source: Dr. T. Doerge, Department of Soil and Water Science, University 
of Arizona. 


a. Write the linear model for the experiment, explain the terms, and compute the analysis of 
variance. 

b. Compute the standard error estimates for the marginal means of nitrogen inhibitor and timing 
of nitrogen application and the cell means. 

c. Test the null hypotheses of no interaction effects and no main effects for the two factors. 
Compute the relative efficiency of the randomized complete block design. 

e. Obtain the residual plots from the analysis, and interpret them. 


Use the data from Exercise 8.1 for the Valencia orange grove irrigation experiment. Assume the 

plots for trickle irrigation in block 1 (450) and flood irrigation in block 5 (336) were lost from the 

experiment. 

a. Use an appropriate computer program to compute the orthogonal analysis of variance accord- 
ing to the following partitions: 


Source of Variation: SS Blocks (unadjusted) = 432,384 
SS Irrigation Treatments (adjusted for blocks) = 51,923 
SS Error = 130,402 


Test the hypothesis of no treatment differences. : 
c. Show the least squares estimates of the treatment means and their standard error estimates if 
the computer program is capable of producing the estimates. For treatment means with no 


304 CHAPTER8 COMPLETE BLOCK DESIGNS 


10. 


11. 


missing observations, the standard errors should be the same as those computed in the usual 
manner. 

d. How have the standard errors for the trickle and flood irrigation treatment means been affected by the 
lost plots? 

e. . If the computer program is capable, compute the standard error of the difference between | 
(1) the trickle and flood irrigation means 
(11) the trickle and basin irrigation means 
(111) the flood and sprinkler irrigation means 
(iv) the basin and sprinkler irrigation means 

f. How were the standard errors in part (e) affected by the lost plots? What effect do the missing 
plots have on associated tests of differences between pairs of treatment means? 


An animal scientist conducted a beef animal-feeding trial with four treatments composed of differ- 
ent qualities of drinking water for the animals in a completely randomized design with two replica- 
tions. The experiment was conducted in the spring months and the winter months on two 
successive years. Each of the four feeding trials lasted 112 days. The data that follow are the aver- 
age daily gains for each pen of animals in each of the trials. 


Year | Year 2 
Treatment Spring Winter Spring Winter 
] 1.81 2.14 206  2,1/ 
88-22 1.9] Paes i, 
2 ie ween es: tar. 200 
1.66 202 be oe 
3 £35 ag 1.51 PRP a, 
1.59 1.93 1.49 1.94 
4 £33 1.85 1.31 1.83 
1.56 1.95 beg 215 
Source: Dr. D. Ray, Department of Animal Sciences, University 


of Arizona. 


a. Compute the analysis of variance for each of the four trials as a completely randomized design. 
Determine whether the experimental error variances are homogeneous among the experiments. 

c. Compute the combined analysis of variance for the four trials with year, season, treatment, and 
all interaction effects in the model. The experimental error is the pooled experimental error 
from the four analyses of variance from the separate trials. 

d. Assume years are random while seasons and treatments are fixed effects. Test the null 
hypotheses for seasons, treatments, and interaction effects. What are your conclusions? 


Use the data from Exercise 8.4, the study on traffic signals. Assume the observations at intersection 

| during time period 2 (33.8) and at intersection 4 during time period 5 (21.6) are missing. 

a. Use an appropriate computer program to compute the orthogonal analysis of variance 
according to the following partitions: 


12. 


13. 


14. 


45: 


16. 
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Source of Variation 

Intersections (unadjusted) 

Time periods (adjusted for intersections) 

Signal sequence (adjusted for intersections and periods) 
Error 


b. Test the hypothesis of no signal sequence differences. 
c. Show the least squares estimates of the signal sequence means and their standard error esti- 
mates if the computer program is capable of producing the estimates. 


~d. How do the lost data affect the standard errors for the sequence B and C treatment means? 


e. Ifthe computer program is capable, compute the standard error of the difference between 
(1) the sequence B and C means 
(ii) the sequence B and A means 
(111) the sequence C and D means 
(iv) the sequence A and D means 
f. How were the standard errors in part (e) affected by the lost data? What effect do the missing 
data have on associated tests of differences between pairs of treatment means? 


An experiment is to be conducted in a randomized complete block design with t = 6 treatments in 

r = 4 blocks. | 

a. Randomize the six treatments to the experimental units in a randomized complete block 
design. Show details of your randomization procedure. 

b. How many different arrangements of the treatments are possible in each of the blocks? 

c. How many different arrangements are possible for the entire experiment? 


An experiment is to be conducted in a Latin square arrangement with t = 5 treatments. Choose one 
of the standard Latin squares from Appendix 8A, and randomize the five treatments to the experi- 
mental units in a Latin square arrangement. Show details of your randomization procedure. 


Construct a standard square for a 7 x 7 Latin square arrangement. 

a. Randomize the seven treatments to the experimental units in a Latin square arrangement. Show 
details of your design construction and randomization. 

How many ways can the Latin square columns be arranged on the column blocks? 

How many ways can the Latin square rows be arranged on the row blocks? 

How many ways can the Latin square letters be arranged on the treatment labels? 

How many arrangements of columns, rows, and treatments are possible for the entire 
experiment? 


7 ee Se 


An experiment is to be conducted in a 4 x 8 Latin rectangle arrangement with four treatments. 
Choose two standard Latin squares from Appendix 8A, and randomize the four treatments to the 
experimental units in the Latin rectangle arrangement. Show details of your randomization 
procedure. 


An experiment is to be conducted on accelerated failure tests with small electric motors at five dif- 
ferent temperatures. A maximum of five tests can be conducted in one day. There are 20 motors 
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Ly. 


18. 


19. 


available for testing. Design an experiment with a randomized assignment of temperatures to the 
motors. Show a sketch of the final set of tests that are to be conducted. 


You are designing a study to investigate soil-plant relationships in oak—pine mixed forests. The 

factor under study is the percent of oak in the forest mix. You have identified three suitable 

replicate sites for each of the following oak percentages in the forest mix: (1) 0%, (2) 20%-—30%, 

(3) 45%-55%, (4) 70%-80%, and (5) 100%. You must collect soil and forest floor litter samples 

from each site and conduct laboratory chemical analyses of the samples. The labor is to be divided 

equally among three people because of the amount of work involved in the field and the laboratory. 

a. Design the study to control the experimental error variation with the 15 chosen sites using 
yourself and two other people as workers. 7 

b. Sketch the analysis of variance for data from the study. Include the source of variation and de- 
grees of freedom for each sum of squares partition. 

c. Suppose you take two samples from each site. Repeat part (b). 


You are conducting an in-vitro feedstuff digestion trial in flasks that must be inoculated with CO 

and rumen microorganisms obtained from a steer just prior to inoculating the flasks in the lab. 

Oxygen and temperatures less than 37°C adversely affect the microorganisms. Given the time re- 

quired to add the CO, and microorganisms to the flasks even under the best of controlled condi- 

tions the first flasks receive warm healthy microorganisms, but the later flasks receive 
microorganisms with reduced activity. 

a. Suppose you have five treatments and 25 flasks to inoculate in sequence by yourself. Set up a 
complete block design for the study to control variation caused by reduced microorganism 
activity due to exposure. 

b. Sketch the analysis of variance for data from the study. Include the source of variation and de- 
grees of freedom for each sum of squares partition. 

c. Suppose you have 50 flasks available for the five treatments. How could you design the study? 
Repeat part (b). 


You are going to conduct a study to determine the contamination of stream water by human activi- 
ty in a national forest. You have located four streams, each of which has a small permanent com- 
munity located near the stream. The communities each have a waste disposal plant in the watershed 
of the stream. Also, each stream has a large recreational camp site located five to ten miles down- 
stream from the community. 

You want to take a water sample at each of four locations on each stream: a sample upstream 
from the community, a sample one mile below the community, and one sample each from immedi- 
ately above and below the recreational campsite. 

You are also required to take a sample on each of four days of the week: Friday, Sunday, Mon- 
day, and Wednesday. Your resources are limited such that you can only take 4 water samples from 
each stream for a total of 16 water samples for the entire study. 

a. Set up a complete block design to acquire the water samples with “stream location” as the 
treatment factor. | 

b. Sketch the analysis of variance for the data showing source of variation and degrees of 
freedom for each sum of squares partition. 

c. Suppose you take two water samples each time you sample a stream location. Repeat part (b). 


APPENDIX 307 


Selected Latin Squares 


8A Appendix 


4x4 


AG 
uA DS 
LC py AS 


AVEC DD AEC A 8 GC) 
» > A Co 


BAD Cc 
Cc D-B- A 


yD oO PA 
[2 2 eS 


c AD SB 


Pe oA D CB A 


ee eS 


DCA 


5x5 


A.B CC DD -E 
|: oe ee Es eee © 
Ce. Be eS 


A 8 42D 


pe Pe 


po 2 


Doe ft. 


Cn. 2 e 


C2 ne CUB 
Do Er Bb AC 


ie BA 


Dp . F BA 
os Se 


LA BD 


rt BA 


x 2 -& DD & aA Ee © DE 


Aa De Os D E 


B © Ae DOD 
. F 2 A 
moe ep A 


Bat. ef A RP ©£© & A.D 


on sD 
D AB 2. 2 


no 
D2 ot A 
Ea ee BC 


Ss 2 8 CA 


Pe A 


6x6 


2 & DE F 


nwo DS 


ee ee ee 
6 A DE 
2 ke. FA 


pt pf ee 
. 2 Bo ae 


oo a FF § gee 
ee ae ee = ee ee 


Se 2 AY SC 


Ps Bt. 2s 


Ye Be mh ee 


aon Cc. DEF 


> 2. o. De 7 


Oe ee ae, Os eee © 


Sc fA ES 
Ct 2 fA 
D 2 2 oo 


Co) 8 kD. ee 
D2. Be A. 
ee ee ee oe - eS 


E Ab fF fe 


ro 2 8-8 


ee oe ea oe 


308 CHAPTER8 COMPLETE BLOCK DESIGNS 


oe ame | 


Aon 4 et ee ee 
ee ae et ee a FS 
Caf ye 2 AE 


MR By Gap 2 FG 
BE AiG) pe 
€ EF GHD AE 


D+ Geb «a? 2e oA 
Es DaBetl 4A Ff 


D ££ EE & G@ By A 


be DB ee he GF 
r «£2 DA G@ 


PeG,cDeAnO EB 
Coe fc 8 


oo A 2 ee 


8 x 8 


Ap Bo. tae Ce Fr Ge 


A @. 2 2 2 Bo 


5. Go ae eS a OG 
C, Axe iat =F SS 


BA fs ees 
C pS) Ls es 
D2) fe oe 
E Fn ASS ew 
FG 2 & 2 4 2 
G Hw Be Bes 


Db. f.. Gee we BE 
EF H BoP 2 sD 
PD H-AjSs Ss 2 
iy Ey Pb Oe A 
w GG? Fee eB CC OF 


BAe ef PP eS 


9x9 


| 


Ae ee FP Ue 


A 


I 


3 3 ee: SG 
Co Eos ae 


D = Yo ft Apso 
a A Bo Le 
ee © eee A 6b C Pe 
ee gee, See Bee ee 
a = DD EPG 
ee ew eS PCO ee 


| 


I 


I 


oo 


| 


APPENDIX 309 


10 x 10 


J 


I 


a ee ee Fe 
D C PP £& Ff An 
(ee AS 
ee ae ee, © eee | 


Se? ae = ee 


It 


A Bm. < —D 


J 


I 


Pee 


Gee ee Sy ee 


J 


| 


Poh 


ae a EF 
mote A ee 
moe DE Pe Ae 
AE ee ee Fo 


J 


I 


] 


J 


9 Incomplete Block Designs: 


2.1 


310 


An Introduction 


It is sometimes necessary to block experimental units into groups smaller than a 
complete replication of all treatments with a randomized complete block or Latin 
square design as illustrated in Chapter 8. The incomplete block design is utilized to 
decrease experimental error variance and provide more precise comparisons 
among treatments than is possible with a complete block design. A general descrip- 
tion of some major groups of incomplete block designs is presented in this chapter. 
The method of randomization and basic analysis methods are demonstrated for 
balanced and partially balanced incomplete block designs. The efficiency of the de- 
signs is also considered. 


Incomplete Blocks of Treatments to Reduce Block Size 


Experiments can require a reduction in block size for one of several reasons. Com- 
plete block designs can reduce the estimate of experimental error variances, but 
sometimes the reduction is insufficient. Alternatively, the number of treatments 
may be so large as to render a complete block design impractical for reducing 
experimental error variance. Also, the natural grouping of experimental units into 
blocks can result in fewer units per block than required by the number of treatments 
for a complete block design. In the following example, limited numbers of environ- 
mental control chambers prevented the performance of a complete replication of all 
treatments in one run of the available chambers. 


9.1 INCOMPLETE BLOCKS OF TREATMENTS TO REDUCE BLOCK SIZE 311 


Example 9.1 Tomato Seed Germination at Constant High Temperature 


Tomatoes often are produced during the winter months in arid and tropical 
regions. Winter production requires seeding during late summer when soil 
temperatures can exceed 40°C, which surpasses the suggested maximum ger- 
mination temperature of 35°C for tomato seed. 


Research Objective: A plant scientist wanted to determine in what tempera- 
ture range she could expect inhibition of tomato seed germination for a group 
of tomato cultivars. 


Treatment Design: Four temperatures were chosen to represent a tempera- 
ture range common for the cultivation area under consideration. They were 
25°C, 30°C, 35°C, and 40°C. The tomato seed was to be subjected to a con- 
stant temperature in a controlled environment chamber. 


Experiment Design: A single chamber would be an experimental unit since 
true replication of any temperature treatment required a separate run of the 
temperature treatment in a chamber. Any number of factors could contribute 
to variation in response between runs since the entire experimental setup had 
to be repeated for a replicate run. Thus, blocking on runs was considered 
essential. 

One complete block and replication of the experiment required four cham- 
bers; however, only three chambers were at the disposal of the plant scientist. 
Since the natural block of one run had fewer chambers (experimental units) 
than treatments, she constructed an incomplete block design. 

A diagram of the design is shown in Display 9.1. Three different tempera- 
tures were tested in each of the four runs. The runs represent incomplete 
blocks of three temperature treatments. The treatments were randomly as- 
signed to the chambers in each run. Some special features of this design are 
discussed in the next section. 


Display 9.1 An Incomplete Block Design with Four 
Treatments in Blocks of Three Units 


Chamber Chamber 


1 . Ee 1 2 2 


Chamber Chamber 


1 Z 3 1 = 3 


Source: Dr. J. Coons, Department of Botany, Eastern Illinois University. 


312 CHAPTER9 INCOMPLETE BLOCK DESIGNS: AN INTRODUCTION 


Other Examples: Batches of material for industrial research serve as blocks, but in- 
sufficient material may exist in a single batch for all experimental treatments. The 
criteria for matching subjects may result in insufficient numbers of like subjects or 
cohorts in each group to accommodate the treatments planned for the study. Agro- 
nomic variety trials often contain large numbers of varieties for test and complete 
blocks are not practical for reduction of error variance. Incomplete block designs 
are suitable choices for the experiments in each of these examples. 

The guiding principle for block size is to have a homogeneous set of experi- 
mental units for more precise treatment comparisons. Incomplete block designs 
were introduced by Yates (1936a, 1936b) for experiments in which the number of 
experimental units per block are less than the number of treatments. The designs 
were developed from a need for experiments that included the relevant set of treat- 
ments to address the research hypotheses yet were constrained to sensible block 
SIZes. 3 

The incomplete block designs may be classified into two major groups of de- 
signs: those arranged in randomized incomplete blocks with one blocking criterion 
and those with arrangements based on Latin squares with two blocking criteria. The 
designs may be balanced wherein each treatment is paired an equal number of 
times with every other treatment in the same blocks somewhere in the experiment. 
A partially balanced design occurs when different treatment pairs occur in the same 
blocks an unequal number of times or some treatment pairs never occur together in 
the same block. An overview of incomplete block designs will be presented in this 
chapter along with an introduction to the analysis of data from these designs. 


9.2 Balanced Incomplete Block (BIB) Designs — 


The BIB Design Compares All Treatments with Equal Precision 


The balanced incomplete block design is arranged such that all treatments are 
equally replicated and each treatment pair occurs in the same block an equal num- 
ber of times somewhere in the design. The balance obtained from equal occurrence 
of all treatment pairs in the same block results in equal precision for all compari- 
sons between pairs of treatment means. 

The incomplete block design has r replications of ¢ treatments in b blocks of k 
experimental units with k<t. The total number of experimental units is 
N =rt= bk. The design for the tomato experiment in Display 9.1 has b=4 
blocks of k = 3 experimental units. Each of the t = 4 treatments is replicated r = 3 
times. There are a total of N = bk = 4-3 = 12 or N=ri=3 -4= V2 exper 
mental units. Upon inspection it can be seen that each treatment pair occurs to- 
gether in the same block twice. For example, the treatment pair (25°, 30°) occurs in 
blocks 1 and 2 and the treatment pair (30°, 35°) occurs in blocks 2 and 4. 

The number of blocks in which each pair of treatments occurs together is 
\ = r(k — 1)(t — 1), where \ < r < b. The integer value derives from the fact 
that each treatment is paired with the other t— 1 treatments somewhere in the 
design X times. There are A(t{—1) pairs for a particular treatment in the 


9.3 Howto 
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experiment. The same treatment appears in r blocks with k — 1 other treatments, 
and each treatment appears in r(k — 1) pairs. Therefore, 


Mt-—1l)=r(k—-1) or A=r(k—-1)/(t—1) 


For the tomato experiment design in Display 9.1, A = 3(3 — 1)/(4 — 1) =2. 
A balanced incomplete block design can be constructed by assigning the 


appropriate combinations of treatments to each of 6 = i) blocks to achieve a 


balanced design. Frequently, balance is possible with less than ( ) blocks. 


There is no single method for constructing all classes of balanced incomplete 
block designs. Methods do exist for constructing certain classes of incomplete 
block designs. The topic of design construction has been the subject of much math- 
ematical research, resulting in a vast array of balanced and partially balanced in- 
complete block designs. 

Plans for small numbers of treatments are given in Appendix 9A.1. Additional 
tables of designs useful for many practical situations can be found in Cochran and 
Cox (1957) and Fisher and Yates (1963). Several important categories of tradition- 
al balanced incomplete block designs will be illustrated in more detail in Chapter 
10. An introduction to the basic structure of incomplete block designs with illustra- 
tions of their application and analysis is the focus of this chapter. 


Randomize Incomplete Block Designs 


After the basic design has been constructed with the treatment code numbers, the 
steps in randomization follow: 


Step 1. Randomize the arrangement of the blocks of treatment code number 
groups. 


Step 2. Randomize the arrangement of the treatment code numbers within each 
block. 


Step 3. Randomize the assignment of treatments to the treatment code numbers in 
the plan. 


Randomization is illustrated with the basic design plan for t = 4 treatments in 
6 = 4 blocks of k = 3 experimental units each. Prior to randomization the plan is 


Block 
| 


Ny} | —| — 
LI} GI} DO} dO 
eS Fs BS age eS ES 


ws 
3 
4 
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Step 1. Suppose the blocks for the experiment are the runs of three growth cham- 
bers with three temperature treatments used in Example 9.1. The treatment groups 
(1,2, 3), (1,2, 4), (1,3,4), and (2,3,4) must be randomly assigned to the runs. 
Choose a random permutation of the numbers | to 4, and assign the four blocks to 
the four runs. With the random permutation 2,4, 1,3 the assignment is 


Run Original Block 
l l 2 4 2 
2 f/ 3 4 4 
3 l 2 2 l 
4 l 3 4 3 


Step 2. Assign random treatment code numbers to the three growth chambers in 
each run. Choose a random permutation of the numbers | to 4 for each chamber, 
and omit the treatment number absent in the run. Four random permutations along 
with the assignment to each chamber (A, B, C) in each run follow: 


Chamber 
Run A B C Permutation 
| 2 4 | Z 4 3 | 
2 3 4 2: 3 4 yt 2 
3 l 2 3 4 | 2 A 
4 | 4 3 | 4 3 2 


Step 3. Suppose the treatments are temperatures of 25°C, 30°C, 35°C, and 40°C. 
A random permutation 2,4,3,1 gives a random assignment of temperatures to 
treatment code numbers with the replacements 225°C, 430°C, 3 435°C, and 
1 + 40°C in the previous display. 


Chamber 
Run A B C 
iin i O's Ore @ 
37% “30° I 
Wo 25 | oe 
40°C SUC are 


BR WN — 
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9.4 Analysis of BIB Designs 


Statistical Model for BIB Designs 
The linear statistical model for a balanced incomplete block design is 
ij = MET + pj t+ ei; (9.1) 
Te Pe Se oe ee 


where yz is the general mean, 7; is the fixed effect of the ith treatment, p; is the 
fixed effect of the jth block, and the e;; are independent, random experimental 
errors with mean 0 and variance o”. | 

Recall that there are r replications of the ¢ treatments in b incomplete blocks of 
k, experimental units. The total number of observations is N = rt = bk, with each 
treatment par appearing oon in A = r(k — 1)/(t — 1) blocks in the experiment. 

The effects of treatments and blocks are not orthogonal in the incomplete block 
design because allt treatments do. not appear i in each of the blocks. Therefore, , ie 
t nent me stimates of | i; = = u re = “The parameter esti- 
mates aaa ae ae sum of squares for the balanced incomplete block designs can 
be computed with relatively straightforward formulae. A derivation of the least 
Squares estimates for the balanced incomplete block design is given in Appendix 
9A.3. 


Sum of Squares Partitions for BIB Designs 


The sum of squares partitions can be derived by considering alternative full and re- 
duced models for the design. Solutions to the normal equations are obtained for the 
full model, y;; = 4+ 7; + pj + eij;, with estimates, Y,; = +7; +p,;, to compute 
the experimental error sum of squares for the full model, 


SSEs = BE (vis = Fy = BE (vis = p- 7 - py (9.2) 


Solutions to the normal equations for the reduced model, y;; = w + p; + e:;, with 
estimates, y;; = i + ‘P;, are used to compute the experimental error sum of squares 
for the reduced model, 3 


SSE, = BE Wis — 4) = BE (vis — p- py (9.3) 


The difference, SSE, — SS Ey, is the reduction in sum of squares as a result of 
including 7; in the full model. It is the sum of squares due to treatments after block 
effects have been considered in the model. It is referred to as SS Treat- 
ment(adjusted), implying that block effects are also considered when estimating the 
treatment effects in the full model. For balanced incomplete block designs the sum 
of squares for treatments adjusted, SSE, — SS'E;, can be computed directly as 
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t 
ky Qi 
SST (adjusted) = a (9.4) 
with t—1 degrees of freedom. The quantity Q; is an adjusted treatment total 
computed as 


I 
Qi =u. — 7B: (9.5) 


where 6; = — nijy; is the sum of all block totals that include the 7th treatment, 
and n;; = | if treatment i appears in block 7 and n;; = 0 otherwise. This correction 
to the treatment total has the net effect of removing the block effects from the treat- 
ment total. 

The sum of squares for blocks is derived from the reduced model with treat- 
ments ignored in the model as SSB (unadjusted) = S'S Total — SSE,. Treatment 
effects are not considered when estimating the block effects, and the sum of squares 
is called an unadjusted sum of squares. 

The additive partition of S'S Total is 


SS Total = SS B(unadjusted) + SST (adjusted) + SS Ey (9.6) 


The analysis of variance outline for the sum of squares partitions is shown in Table 
9.1. Many computer programs are capable of computing the correct sum of squares 
partitions, least squares estimates of treatment means, contrasts among least squares 
means, and their standard errors for incomplete block designs. 


Table 9.1 Analysis of variance for a balanced incomplete block design 


re ae Reet re ee errno 5 eee oe 

Source of Degrees of Sum of Mean 
Variation Freedom Squares Square 

Total Newd Wineries 

Blocks b— 1 b> O.= yy MS B(unadj.) 

k - 
Treatments t—1 ae MST (adj.) 
Error N-t—6b+1 By subtraction MSE 


Example 9.2 Vinylation of Methyl Glucoside 


The addition of acetylene to methyl glucoside in the presence of a base under 
high pressure had been found to result in the production of several monovinyl 
ethers, a process known as vinylation. The monoviny] ethers are suitable for 
polymerization in many industrial applications. 


Research Objective: The chemists wanted to obtain more specific informa- 
tion about the effect of pressure on percent conversion of methyl glucoside to 
monovinyl isomers. 
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Treatment Design: Based on previous work, pressures were selected within 
the range thought to produce maximum conversion. Five pressures were 
selected to estimate a response equation: 250, 325, 400, 475, and 550 psi. 


Experiment Design: Only three high-pressure chambers were available for 
one run of the experimental conditions. It was necessary to block on runs be- 
cause there could be substantial run-to-run variation produced by new setups 
of the experiment in the high-pressure chambers. The chemists set up a bal- 
anced incomplete block design with ten blocks (runs) each with three experi- 
mental units (pressurized chambers). Three different pressures were used in 
each run. The resulting design had six replications of each pressure treatment. 

The pressures used in each run and the percent conversions to monovinyl 
isomers are shown in Table 9.2. The additive sum of squares partition of 
methyl glucoside is shown in Table 9.3. 


Table 9.2 Percent conversion of methyl glucoside by acetylene under high pressure in 
a balanced incomplete block design 


Pressure (psi) 


Run 250 325 400 475 550 Y5 
1 16 18 ‘< 32 = 66 
7: 19 = = 46 45 110 
3 Lt 26 39 = 61 126 
4 ce ie 21 35 55 111 
5 _ 19 < 47 48 114 
6 20 = 33 31 ss 84 
7 13 13 34 = = 60 
8 21 = 30 se 52 103 
9 24 10 = 50 84 
10 — 24 31 37 = 92 
Yi 113 110 188 228 311 950 

*B; 507 542 576 577 648 

'Q; — 56.0 ~ 70.7 — 4.0 35.7 95.0 


“Example: By =yity2t+yetyrt+ystys 
'Q, =m. — 1B, = 113 — 16607) = — 56.0 | 


Source: Drs. J. Berry and A. Deutschman, University of Arizona. 


Inferences for Treatment Means 


The least squares estimates of the treatment means and their estimated standard 
errors are shown in Table 9.4. The least squares estimate for a treatment mean jy; is 
2; =L+7;, where 
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Table 9.3. Analysis of variance for percent conversion of methyl glucoside in a 
balanced incomplete block design | 


Source of Degrees of Sumof Mean 
Variation Freedom Squares Squares F fro F 
Total 29 5576.67 
Blocks (unadj) 9 1394.67 154.96 
Pressure(adj) 4 3688.58 922.14 29.90 .000 
Error 16 493.42 30.84 

pean =. ee 

w=y and T= ar (9.7) 


For example, from Table 9.2, Q; = — 56.00 and # = y, = 950/30 = 31.67, so 
that 


Pee ee 


= — = — 56.00) = — 11.20 d ji. see = 7120 = 20.47 
T1 WY ays) ) ana ty 0 


Standard Errors for Treatment Means 


The standard error for a treatment mean estimate is 


kr(t — 1) 
rt us Mt )- 


A 95% confidence interval estimate of a treatment mean in Table 9.4 is 
BH; + t.025,16(sq,), where t.o25,16 = 2.120. The standard error of the estimated 
difference between two treatment means, 1; — Lj is 


ee 2kMSE ie (2)(3)30.84 aes 9.9 
iG.-2) = Vira, & Oye (9.9) 


Table 9.4 Least squares estimates of pressure means for percent conversion of 
methyl glucoside in a balanced incomplete block design 


30.84 ( (3)(6)(4) 


O® '* OS® ) =244 eo 


Pressure (psi) Li; Sp, 
250 20.47 2.44 
Ps pee 2.44 
400 30.87 2.44 
475 38.80 2.44 


550 50.67 2.44 
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Tests of Hypotheses About Treatment Means 


The Fo statistic to test the null hypothesis of no differences among the treatment 
means is 


eo MST(adj.) 922.14 _ ae — 
Te eee ee anaes 


with Pr > Ff’ = .000 (Table 9.3). The critical value at the .05 level of significance 
is F'o5,4,16 = 3.01. There are significant differences among the pressures with 
respect to conversion of methyl! glucoside to vinylation products. 


Contrasts Among Treatment Means 


Treatment contrasts are estimated with least squares estimates of the treatment 
means 7, as 


t 
c= Soda, (9.11) 


with standard error estimate 


(9.12) 


The statistic t) = c/s, can be used to test the null hypothesis Hy: C = 0 with a cri- 
tical value based on the Student ¢ statistic with N — t — b + 1 degrees of freedom. 

The | degree of freedom sum of squares for the contrast can be computed with 
the least squares means 7/, as 


_ Medi? 
«kDa? 


The statistic fp = SSC/MSE is used to test the null hypothesis Hp: C = 0 with 
critical value Fa 1 (—t-0+1): 

The pressure treatment for this experiment is a quantitative factor with five lev- 
els. The regression of percent conversion of methyl glucoside on pressure using 
orthogonal polynomial contrasts for pressure will describe the effect of pressure on 
conversion rate. The regression analysis is left as an exercise at the end of the 
chapter. 


SSC (9.13) 


Recovering Treatment Information from Block Comparisons 


The analyses for incomplete block designs illustrated thus far estimate treatment ef- 
fects based on treatment information contained within the blocks; this is referred to 
as intrablock analysis. The incomplete block designs are nonorthogonal because 
not all treatments appear in all blocks, and comparisons among the blocks contain 
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some information about treatment comparisons. Yates (1940a) showed this inter- 
block information can be recovered with an interblock analysis and combined with 
the information from the intrablock analysis. 


Block Contrasts Contain Treatment Contrasts 


The information on treatment comparisons contained in a comparison between 
blocks can be illustrated simply with the first two runs of the chemistry experiment 
in Example 9.2. The data for percent conversion of methyl glucoside in the first 
two runs of the experiment are 


Presssure (psi) 
Run. 250 325 400. AJ3... DOU fae 
l lo - [8 — a2 ae G3 
Z 19 - — 46 45 37 


A contrast between the two runs means, 37 and 22, is also a contrast between two 
of the pressure treatments, 550 psi and 325 psi. Similar treatment comparisons are 
contained in all of the block comparisons. The objective is to recover this between- 
block, or interblock, information about treatments and combine it with the within- 
block, or intrablock, information about treatments. 

The subject of interblock information recovery is mentioned here only to indi- 
cate the availability of the method. A thorough treatment on the recovery of inter- 
block information can be found in Kempthorne (1952), Cochran and Cox (1957), 
John (1971), John (1987), and John and Williams (1995). 

Information from the interblock analysis is incorporated into the intrablock 
analysis with an estimator of the treatment effects that combines the intrablock and 
interblock estimators of the treatment effects. 

If blocking has been very effective in reducing experimental error, the inter- 
block estimate contributes only a small amount of information to the combined esti- 
mate. On the other hand, if the block effects are small the information recovered 
from the interblock estimate can be substantial. If blocking is not effective, then the 
analysis with recovery of interblock information virtually reduces to the ordinary 
analysis with no block adjustments. 


9.5 Row—Column Designs for Two Blocking Criteria 


When the need exists to control variation with more than one blocking criterion the 
Latin square design is a complete block design used to control variation among the 
experimental units with two blocking factors. Latin square designs may be imprac- 
tical in some situations since the number of experimental units required by the de- 
sign, NV = t?, can exceed the constraints of the experimental material or treatment 
numbers can exceed available block sizes. 
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Row-—column designs can be used with either rows or columns or both rows 
and columns as incomplete blocks when two blocking criteria are required for the 
‘experiment. The designs are arranged in p rows and q columns of experimental 
units. Consider the classical experiment to test four automobile tire treads on the 
four positions of four automobiles in a Latin square design. Suppose the research 
team wanted to evaluate t = 7 tire treatments. The requirement to control variation 
due to tire position and automobile is still necessary. However, the automobiles 
only have four positions on which to test the seven treatments. The row—column de- 
sign with an incomplete set of treatments in each column shown in Display 9.2 can 
be used for the experiment. 


Display 9.2 A 4x 7 Row—Column Balanced Incomplete 
Block Design 


Automobile 


Position (1) 


The automobiles are used as incomplete blocks with four treatments evaluated 
on the four tire positions. The positions are complete blocks since each treatment is 
evaluated at each of the positions. Upon close inspection it can be seen that each 
treatment pair occurs on the same automobile two times somewhere in the experi- 
ment. Therefore, the incomplete block design on automobiles is balanced. The de- 
sign is naturally balanced with respect to the positions because they constitute 
complete blocks. The example is clearly a case where an incomplete block design 
was necessary because there was an insufficient number of positions available to 
test all treatments at one time. 


Row Orthogonal Designs Have a Complete Replication in Each Row 


Given the row—column design in Display 9.2 is a complete block design for rows 
and a balanced incomplete block design for columns the design is referred to as a 
row orthogonal design (John, 1987). Since each treatment occurs in each row, the 
treatments are orthogonal to the rows. 

Youden (1937, 1940) developed incomplete Latin square arrangements, now 
known as Youden squares, by omitting two or more rows from the Latin square de- 
sign. The design parameters are t = b, r =k, and \ = k(k — 1)/(t — 1). Plans of 
some Youden squares for small experiments are shown in Appendix 9A.2. Addi- 
tional plans can be found in Cochran and Cox (1957) and Petersen (1985). 

The row orthogonal design in Display 9.2 is a Youden square that has 
r =k =4 rows as replications, b = 7 columns, and t = 7 treatments, with A = 2 


322 CHAPTER9 INCOMPLETE BLOCK DESIGNS: AN INTRODUCTION 


for the incomplete column blocks. The columns are incomplete blocks of 
k =r =4 units, and the rows are complete blocks containing each of the t = 7 
treatments. | 

Randomization in row—column designs is accomplished in the same manner as 
that described for Latin square designs in Chapter 8. There is a separate permuta- 
tion of each of the row and column groups of treatments to the actual blocks, and 
the treatments are randomly assigned to the treatment labels of the design. 


The Analysis Outline for Row—Column Designs 
The linear model for the row—column design is 
Yijm = B+ + Pj + Ym + Cigm (9.14) 
(= I re pe gk MS a 


where pu is the general mean, 7; is the treatment effect, p; is the row effect, 7,, is the 
column effect, and e;,, is the random experimental error. 

Each row contains a complete replication of all of the treatments, and the treat- 
ments are orthogonal to the rows. The columns are also orthogonal to the rows. The 
treatment totals are adjusted only for the incomplete column blocks to provide un- 
biased estimates of treatment means and a valid F' test for treatment effects. The 
intrablock analysis of variance outline is shown in Table 9.5. The analysis of vari- 
ance for the row orthogonal designs only differs from the balanced incomplete 
block designs by the addition of a sum of squares partition for the rows. All other 
aspects of the analysis remain the same. 


Table 9.5 Intrablock analysis for a row—column balanced incomplete block de- 
sign with treatments orthogonal to rows 


Source of Degrees of Sum of 
Variation Freedom Squares 
Total N-1 >, Yiim — 9.) 
Rows (replications) k-1 t>~G@,-— 7.) 
Columns (unadjusted) b—1 ky @Um- yy 
Treatments (adjusted)* i 4 BS OF [Me 

Error (t — 1)(k —1)—-—(b—-1) By subtraction 


*Q; = yi... — (B;/k) where B; is the sum of column block totals that include treatment 7. 


9.6 Reduce Experiment Size with Partially Balanced (PBIB) 
Designs 


Balanced designs cannot be constructed for every experimental situation requiring 
incomplete blocks. In some cases the required number of replications may become 
prohibitive. Frequently, partially balanced designs requiring much less replication 
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can be constructed. The minimum number of replications required for the balanced 
design is r = A(t — 1)/(k — 1). Suppose an experiment with t = 6 treatments re- 
quires blocks of size k = 4. The balanced design shown in Appendix 9A.1 requires 
r = 10 replications or rt = 60 experimental units. It is completely possible that 60 
experimental units are not available or that the cost of the experiment with 60 units 
is prohibitive. 


Unequal Occurrences of Treatment Pairs 


The partially balanced incomplete block design was introduced by Bose and Nair 
(1939). The partially balanced design has some treatment pairs occurring in more 
blocks than other treatment pairs, and consequently some treatment comparisons 
will be made with greater precision than others. It is more straightforward to use a 
balanced design that provides equal precision for all comparisons between treat- 
ments. However, if resources are limited and sufficient replication is possible, the 
partially balanced design is an attractive alternative to the balanced design that re- 
quires excessive numbers of experimental units. 

Consider the partially balanced incomplete block design for six treatments in 
blocks of four units shown in Display 9.3. The balanced design in Appendix 9A.1 
requires ten replications for balance with 4 = 6. The partially balanced design 
shown in Display 9.3 has two replications and requires twelve experimental units in 
three blocks of four experimental units. 


Display 9.3. A Partially Balanced Incomplete Block Design 
with Six Treatments in Blocks of Four Units 


Block 1 


Block 2 
Block 3 


Some pairs of treatments occur together in two blocks, whereas other pairs oc- 
cur together in only one block. Treatment pairs (1,4), (2,5), and (3,6) occur to- 
gether in two blocks. All other treatment pairs occur together in only one block. 
Those treatment pairs occurring together in two blocks will be compared with 
somewhat greater precision than those pairs that occur together in only one block. 
The differing precisions for treatment comparisons is the sacrifice made for the 
smaller experiment. However, the difference in precision is not so great as to inhibit 
the use of partially balanced designs. 

Increasing the replication number can increase the precision on the treatment 
comparisons. Another repetition of the same experiment will provide four replica- 
tions if the two replications provided by the initial experiment are insufficient. If 
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four replications are sufficient, there is still a gain with the partially balanced de- 
sign in terms of reduced costs over the fully balanced design. 


Associate Classes for Treatment Pair Occurrences 


Each treatment is a member of two or more associate classes in a partially balanced 
design. An associate class is a group of treatments wherein each treatment pair 
occurs together in A; blocks. Treatment pairs that occur together in \; blocks are 
known as zth associates. 

The design in Display 9.3 has two associate classes. The treatment pairs 
(1,4), (2,5), and (3, 6) are first associates with A; = 2. Each pair occurs together in 
two blocks. Each treatment has n; = 1 first associates. The remaining treatment 
pairs are second associates with Ay = 1. There are ng = 4 second associates for 
each treatment. For example, the second associates of treatment | are treatments 2, 
3, 5, and 6. They occur with treatment | somewhere in one block of the design. The 
complete sets of associates are shown in Display 9.4. 


Display 9.4 First and Second Associates for Six Treatments 
in the Partially Balanced Design of Display 9.3 


Associates 
Treatment First Second 
l 4 2,35 556 
a 1,3,4,6 
3 6 L243 
4 ] 2,33 558 
5 Z 1, 3,.4,6 
6 3 |, 2,453 


A catalog for some of the major groups of partially balanced designs with 


two associate classes can be found in Bose, Clatworthy, and Shrikhande (1954) and 


Clatworthy (1973). 


Notes on the Analysis of PBIB Designs 


One advantage of the balanced designs over the partially balanced designs is that 
the manual calculations for the analysis of variance are somewhat simpler. Before 
the advent of modern-day statistical programs it was imperative that realistic manu- 
al computational formulae be available for an analysis of the data. Now it is possi- 
ble to compute the appropriate sum of squares partitions for the analysis of variance 
as well as unbiased estimates of treatment means and their standard errors for most 
designs with available statistical programs. Therefore, the partially balanced design 
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is a realistic alternative provided there are reasonably precise comparisons between 
all treatment pairs. 


Sum of Squares Partitions for PBIB Designs 


The linear model for the partially balanced incomplete block design is the same 
as that for the balanced incomplete block design shown in Equation (9.1). Since the 
treatments are not orthogonal to blocks, the orthogonal sum of squares partitions 
are again derived by fitting the full model, y;; = w+ 7; + p; + e:;, to obtain SS Ey 
and the alternative reduced model without treatment effects, y;; = uw +p; +e;;, to 
obtain SSE,. . 

The sum of squares for treatments adjusted for blocks is derived as 
SS Treatment (adjusted) = SSE, — SSE;. The block sum of squares unadjusted 
for treatments is the same as that shown in Table 9.1 for the balanced design. 

The formulae to compute least squares estimates of treatment effects, treatment 
means, and the adjusted treatment sum of squares are not so straightforward as 
those for the balanced designs. Because there is more than one associate class for 
each of the treatments, more complex adjustments must be made to the treatment 
totals. Fortunately, many of the statistical programs available can perform the com- 
putations. Thus, it is not necessary to utilize the cumbersome manual calculation 
formulae for a thorough analysis of the data. Details of manual calculations can be 
found in Cochran and Cox (1957). Detailed derivations can be found in John 
(1971) and John (1987). 


9.7 Efficiency of Incomplete Block Designs 


The efficiency of one design relative to another is measured by comparing the vari- 
ance for the estimates of treatment mean differences in the two designs. For exam- 
ple, the variance of the difference between two treatment means for the randomized 
complete block (RCB) design is 207,,/r. Is that variance less than its counterpart 
from the completely randomized design 202. /r? The relative efficiency of the RCB 
design to the completely randomized design can be determined because an estimate 
of a? for the latter can be obtained from the data in the RCB design. It is then possi- 
ble to evaluate the effectiveness of blocking to reduce experimental error variance. 

No such luxury exists if we want to determine the efficiency of a balanced 
incomplete block (BIB) design relative to the RCB design. It is not possible to 
compute an estimate of o? for the RCB design from the data in the BIB design. We 
would like to determine whether the smaller block size of the BIB design resulted 
in a smaller experimental error variance. The ratio of variances for a difference be- 
tween two treatment means is still used to compare the BIB design with the RCB 
design. The difference is that it measures only the potential efficiency of the BIB 
design since we are unable to estimate o” for the RCB design. 
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The Efficiency Factor for Incomplete Block Designs 


The variance of the difference between two treatment means in the BIB design is 
2ko;,,/At. If the BIB design and the RCB design have the same number of treat- 
ments and replications the efficiency of the BIB design relative to the RCB design 
is the ratio of the variances 

Cs me ae 


Effici = — 
he a Ode 


(9.15) 


The quantity EF = \t/rk is called the efficiency factor for the balanced incom- 
plete block design. It provides an indication of the loss caused by using an incom- 
plete block design without reducing o?. The BIB design is more precise for the 
comparison of two treatment means than the RCB design if 


2 

O bib At 
2 

Orch rk 


(9.16) 


The efficiency factor is a lower limit to the efficiency of the BIB design rela- 
tive to the RCB design for an experiment with the same number of replications and 
the same error variance o?. Given a BIB design with t = 4,r = 3,k = 3, and \ = 2 
and an RCB design with t = 4 and r = 3, both designs require the same number of 
experimental units but they have different blocking arrangements. The BIB design 
is more precise than the RCB design if 


In other words, of, would have to be about 11% smaller than o2,, for the BIB de- 
sign to be as precise as the RCB design with the same number of replications. 

The intent of an incomplete block design is to reduce the error variance and in- 
crease the precision of the comparisons between treatment means. The goal would 
be to reduce o? for the BIB design so that the inequality shown in Equation (9.16) 
would be achieved. A successful incomplete block design will reduce the error 
variance, and o},, will be smaller than c2,,. Some elements that enter into a success- 


ful blocking strategy are considered in the next chapter. 


EXERCISES FOR CHAPTER 9 


1. A horticulturalist studied the germination of tomato seed with four different temperatures (25°C, 
30°C, 35°C, and 40°C) in a balanced incomplete block design because there were only two growth 
chambers available for the study. The experiment was conducted in a balanced incomplete block 
design. Each run of the experiment was an incomplete block consisting of the two growth cham- 
bers as the experimental units (k = 2). Two experimental temperatures were randomly assigned to 
the chambers for each run. The data that follow are germination rates of the tomato seed. 
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28.90 
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24.38 
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S625 86oC: 
—— 1.34 
— 2.24 

5.90 1.83 
18.27 —— 
842 — 


Source: Dr. J. Coons, Department of Botany, Eastern Illinois 
University. 
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How many times did each treatment pair occur together in the same block? 
What is the efficiency factor for this design? 
Write a linear model for the experiment, explain the terms, compute the intrablock analysis of 
variance, and test the null hypothesis for temperature effects. 
Compute the least squares estimates of the temperature means and their standard error. 

Compute the standard error of the difference between two least squares estimates of 


temperature means. 


Compute the | degree of freedom sum of squares partitions for linear and quadratic orthogonal 


polynomial regression, and test their null hypotheses. 


Are the deviations from quadratic regression significant? 


company is developing an insulating compound. An accelerated life testing procedure is 
conducted to determine the breakdown time in minutes after subjection to elevated voltages. Four 
voltages could be tested on a given run of the test. A balanced incomplete block design was used to 
test the compound with t = 7 voltages in b = 7 runs of k = 4 tests. The minutes to compound 
breakdown were recorded for each test. 


“YANN HW WN 


24 
220.22 
270.85 
360.14 


300.66 


200.67 
170.52 
220.12 


190.78 


Voltage (ky) 
32 36 
38.19 — 

— 7.66 
45.43 — 
$6.74 “922 
55.34 10.41 

— 8.74 


PED 
6.92 


22k 


How many times did each treatment pair occur together in the same block? 
What is the efficiency factor for this design? 
Write a linear model for the experiment, explain the terms, and compute the intrablock 
analysis of variance. 
Compute the least squares residuals for each observation, and analyze the residuals according 
to the procedures discussed in Chapter 4. A typical transformation utilized for data of this 
nature is the natural logarithm. Does that seem appropriate here? 
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e. Transform the data, compute the additive sum of squares partitions, and summarize in an 
analysis of variance table. Describe how the transformation has changed the nature of the 
residuals. 

-f. Compute the least squares estimates of the voltage means and their standard error. 
Compute the standard error of the difference between two least squares estimates of voltage 
means. 

h. Compute the 1 degree of freedom sum of squares partition for linear regression of time to 
breakdown (or a suitable transformation) on voltage, and test the null hypotheses. 

i. Are the deviations from linear regression significant? 


3. | A study was conducted to evaluate a method to measure traffic delay at urban street intersections in 
seven cities. Seven types of intersections were chosen for the study based on their geometry and 
traffic light configurations. Four of the intersection types were measured in each city in an incom- 
plete block design. Four observers were used to measure traffic delay, one at each of the intersec- 
tions for a specified period of time during peak traffic loads. The design was laid out as a Youden 
square with t = 7 intersection type treatments. The row—column blocking consisted of k = 4 
observers as rows and b = 7 cities as columns. The order in which the cities were visited and the 
sequence of intersection type treatments each of the observers measured were randomized 
according to the Latin square procedure. The intersection types were randomly assigned to the 
treatment labels of the basic Youden square and are shown in parentheses in the data table for time 
in queue (seconds per vehicle). 


City 

Observer 1 Z 3 4 ) 6 7 
ae ay Cy de i) 
45.8. 664 270 -926 =] 346 441 

2 GQ) (6). te tf) OD 
28.6 94.6 -64.6--39-9: 34.5 ° 45.7 ~ 2377 

Bo oe eo (eer a) re) 
323-407-832 6-—29:3 47.6 312-744 

4 (3) (). age te A) 3 Oh 
260 311 25 4. 667 66.) 364 


a. Write a linear model for this study, describe the terms, and conduct the intrablock analysis of 
variance for this experiment. 

b. Compute the standard error of the difference between least squares means of two intersection 
means. 

c. What is the efficiency factor for this design? 

d. Use the Multiple Comparisons with the Best procedure to select the set of intersection types 
with the minimum delay. 


4. (Note: This exercise presumes you have available a program to compute the analysis of variance, 
least squares estimates of treatment means with their standard errors, and contrasts with their 
standard errors.) A study was conducted in a partially balanced incomplete block design to evaluate 
t = 9 feed rations on nitrogen balance in ruminants. The study required expensive digestion stalls 


5. 
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and equipment, and only three such stalls were available for the study. The design. consisted of 
r = 3 replications of the 9 treatments in b = 9 blocks of k = 3 animals. There were n; = 6 first 
associates for each treatment with A; = 1 and ny = 2 second associates for each treatment with 
\2 = 0. The observed responses are shown in the table with treatment numbers in parentheses. 


Block 
1 33.7241) 37.80 (2) A225 (3) 
2 38.58 (1) 45.39 (4) 47.75 (6) 
3 34.55 (1) 34.82 (5) 38.29 (7) 
4 42.95 (2) 45.35 (4) 48.84 (9) 
5 451242) 36.36 (5) 40.58 (8) 
6 43.04 (3) 45.22 (6) 37.26 (8) 
7 40.64 (3) 30.49 (7) 36.34 (9) 
8 38.53 (4) 34.58 (7) 40.81 (8) 
9 36.40 (5) 33.28 (6) 38.46 (9) 


Source: J. L. Gill (1978), Design and analysis of experiments in the animal and 
medical sciences, Vol 2. Ames, Iowa: lowa State University Press. 


a. Make a table of first and second associates for each treatment. 
Write a linear model for the experiment, and explain the terms. 

c. Use your available computer program to compute the analysis of variance to obtain the 
adjusted treatment mean square and the experimental error variance. Test the null hypothesis 
for rations. 

Compute the least squares estimates of the ration means and their standard errors. 

e. Compute the standard error of the difference between two least squares estimates of ration 
means for first associates and second associates. 

f. Frequently, a weighted average standard error is utilized for all comparisons among two treat- 
ment means in a partially balanced design. With n, = 6 first associates and ng = 2 second 
associates, the weighted average variance of the difference between two treatment means for 
this study is 

. ns; + n285 
everarevaraee = 
TL == 19 

where s? and s3 are the estimated variances of the differences between first and second asso- 

ciates, respectively. Compute a weighted average standard error of the difference for this 

experiment. 


An incomplete block design consists of the following arrangement of blocks (1, 2, 3, 4, 5) and 
treatments (A, B, C, D, E). 

Block ] (DAD) 
2 (A, B, D, E) 
3 (A, C, D, E) 
4 (AV BCD) 
5 (A,B; C32) 
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a. What are the design parameters t,r,k, and b? 
b. Verify that the design is balanced. 


6. An incomplete block design consists of the following arrangement of blocks 1 through 6 and treat- 
ments (A, B, C, D, E, F). 


Block 1 (A,B,C) Block 4  (A,B,D) 
25 {(C,Dubp ee 
S (BRR GO: (EF) 


a. What are the design parameters t,r,k, and b? 
b. Is the design balanced? Explain. 


7. Suppose you are to test £ = 6 automobile fuels, and you have k = 3 engines available for the tests. 
It is therefore necessary to use an incomplete block design for the tests with runs of the three 
engines as blocks. Find the appropriate balanced incomplete block design in Appendix 9A.1 to 
conduct the experiment. Randomize the treatment groups to blocks of runs and automobile fuel 
treatments to the engines. 


8. A paired comparisons study is to be conducted to evaluate campsites in a national park. There are 
t = 10 campsite designs, which range from primitive to full-facility sites. Visitors to the national 
park will be selected at random during the month of June and shown photographs of k = 2 of the 
ten campsites. They are to indicate their preference for one of the two campsites based on the 
photographs. Construct a balanced incomplete block design for the study. How many visitors will 
be required for the study? How many replications of each campsite will there be in the study? How 
many times will each pair of campsites be viewed by a visitor? How would the study design change 
if each visitor was asked to view k = 3 campsites? 


9A.1 Appendix: Selected Balanced Incomplete Block Designs 


Plan 9A.1 t=5,k=3\r=6 = 10, \ =3, E = 0.83 
(1,2,3), (1,2,5), (1,4,5), (2,3,4), (3,4,5) reps 1,2,3 
(1,2,4), (1,3,4), (1,3,5), (2,3,5), (2,4,5) reps 4,5,6 


Plan 9A.2 t=6,k =3,r=5,b=10,\ =2, FE = 0.80 
(1,2;5),( 1,250), 3,4), (2 A, 
(2,3,4), (2,3,5), (2,4,6), (3,5,6), (4,5,6) 


Plan 9A.3 t=6,4 =4F =] b= 15..=6,8 =650 
(1,2,3,4), (1,4,536),, (2,3:5,8) reps 1,2 
(1,2,3,5), (1,2,4,6), (3,4,5,6) reps 3,4 
(1,2,3,6), (1,3,4,5), (2,4,5,6) reps 5,6 
(1,2,4,5), (1,3,5,6), (2,3,4,6) reps 7,8 
(1,2,5,6), (1,3,4,6), (2,3,4,5) reps 9,10 
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Plan9A.4 t=7,k =3,r=3,b=7,\=1,E=0.78 
(1,2,4), (2,3,5), (3,4,6), (4,5,7), (,5,6), 
(26,7), (1,3,1) 


Plan 9A.5 te Jibea so 243056 2 Ace 088 
Replace each block in Plan 9A.4 by a block containing the remaining treatments. 


Plan 9A.6 t=285% e493 7,6 =14 A= 3; bee 0.86 
(1,2,3,4); G67 8) rep 1 (1,2,7,8), (3,4,5,6) rep 2 
(1,3,6,8), @.4,5,7) rep 3 (1,4,6,7), (2,3,5,8) rep 4 
(1,2,5,6), (3,4,7,8) rep 5 (1,3,5,7), (2,4,6,8) rep 6 
(1,4,5,8), (2,3,6,7) rep 7 


Plan 9A.7 t=9,k =3,r=4,b=12,\=1,E =0.75 
(1,2,3), (1,4,7), (1,5,9), (1,6,8), (2,4,9), (2,5,8) 
(2,6,7), (3,4,8), (3,557), (3,6,9), (4,5,6), (7,8,9) 


Plan 9A.8 t=O 4 = 47 = 8, 0= 16,1 =3,F S084 
(1,2,3,4), (1,3,8,9), (1,4,6,7), (1,5,7,8), (2,3,6,7), 

(2,4,5,8), (2,6,8,9), (3,5,7,9), (4,5,6,9), reps 1,2,3,4 
(1,2.4,9),(1,2,5,3)44,3,6,8),-01,5,6,9),42;3,550), 

(2,7,8,9), (3,4,5,8), (3,4,7,9), (4,6,7,8), reps 5,6,7,8 


Plan 9A.9 fea k=s t=] ie = BA = 5, et = 6.90 
Replace each block of Plan 9A.8 by a block containing the remaining treatments. 


Plan 9A.10 129 k=6,7 = $,6 = 12, A= 5, 2 = 0.94 
Replace each block of Plan 9A.7 by a block containing the remaining treatments. 


Plan 9A.11 t= 10,k =3,r=9,b = 30,A =2, E = 0.74 
(1,2,3), (1,4,6), (1,7,9), (2,5,8), (2,8, 10) 

(3,4,7), (3,9,10), (4,6,9), (5,6,10), (5,7,8) reps 1,2,3 

(1,2,4), (1,5,7), (1,8,10), (2,3,6), (2,5,9), 

(3,4,8), (3,7,10), (4,5,9), (6,7,10), (6,8,9) reps 4,5,6 

(1,3,5), (1,6,8), (1,9,10), (2,4,10), (2,6,7), 

(2,7,9), (3,5,6), (3,8,9), (4,5,10), (4,7,8) reps 7,8,9 


Plan9A12  t=10,k =4,r=6,b=15,\ =2,E =0.83 
(1,2,3,4), (1,2,5,6), (1,3,7,8), (1,4,9,10), (1,5,7,9), 

(1,6,8,10), (2,3,6,9), (2,4,7,10), (2,5,8,10), (2,7,8,9), 

(3,5,9,10), (3,6,7,10), (3,4,5,8), (4,5,6,7), (4,6,8,9) 


Plan 9A.13 t= 10, k ea¢ 8.46 = 18,A = 4,2 = 0289 
(1,2,3,4,5), (1,2,3,6,7), (1,2,4,6,9), (1,2,5,7,8), (1,3,6,8,9), (1,3,7,8, 10), 
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(1,4,5,6,10), (1,4,8,9,10), (1,5,7,9,10), (2,3,4,8,10), (2,3,5,9,10), (2,4,7,8,9), 
(2,5,6,8,10), (2,6,7,9,10), (3,4,6,7,10), (3,4,5,7,9), (3,5,6,8,9), (4,5,6,7,8) 


Plan 9A.14 tel £S 6 re 9b =| 15,\ = 5, FH 093 

Replace each block in Plan 9A.12 by a block containing the remaining treatments. 
Plan 9A.15 t= lLekes,r=5,6=1,A22,8 = 0.88 

(1,2,3,5,8), (1,2,4,7,11), (1,3,6,10,11), (1,4,8,9,10), 

(1,5,6,7,9), 2,3,4;6;9};(2,5,9;40,11), (2,67,840), 

(3,4,5,7,10), (3,7,8,9,11), (4,5,6,8,11) | 


Plan 9A.16 te llk=6,r=6, 021154 =3,0 =02 
Replace each block in Plan 9A.15 by a block containing the remaining treatments. 


9A.2 Appendix: Selected Incomplete Latin Square Designs 


Table 9A _ Plans for incomplete Latin square designs derived from complete Latin 


squares 

i k; ¢ b nN E Plan 
4 3 3 4 2 89 : 

3 6 8 4 89 = 

3 9 12 6 89 sg 
5 4 4 5 3 94 6s 

4 8 10 6 94 a 
6. 5 5 6 4 .96 és 

5 10 12 8 .96 — 
7 6 6 7 5 97 4 
8 7 Sj 8 6 98 . 
9 8 8 9 7 98 . 
10 9 9 10 8 99 . 
11 10 10 1] 9 99 * 


“Constructed from at x t Latin square by omission of the last column 
™ By repetition of the plan for r = t — 1, which is constructed by taking a 
t x t Latin square and omitting the last column 


Plans for other incomplete Latin square designs: 


Plan 9B.1 tS 5, Ge 2, P= 4y be 108%. 4. 63 
Reps I and II ~ Reps IIT and IV 
py OSes bo Ae aes 


tee es Se be ee ee S| 
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Plan 9B.2 eS a ie S10, A = 33h S85 
Reps I, II, and III Reps IV, V, and VI 
eS ee oe ee Ley oe 5S 
a ee a 20 3 es I 
ee ae ee oe ae Me 2 3 
Plan 9B.3 7, 8 eG 2 ee SA 8, = SS 
Reps I and IT Reps III and IV Reps V and VI 
2 a a ee | oe ee, ee ek ee oe ee 
2 ee oe 3 3° @ BG FA a2 h 3 7 
Plan 9B.4 ree he 3. fas, be 7A 1 ee 
7 + ae tS 
te oe Ss OT 
2 Se ew fF 2 
Plan 9B.5 = ThE Hee as E = 88 
oe oe ee ae 
ite Tt 2-3 4 
G> FF 25 3. 4 35 
ape ee Cae ee. ee ee 
Plan 9B.6 b= Wh 4 2-4 5-36, d= 1; B= 56 
Reps I and II Reps V and VI 
Pots eS Se 6 oy Se ge oe Be 
pe eg ae See Bee Rees eee ae fof ee OT eS 
Reps IIT and IV Reps VII and VIII 
a ae Be ee ee Se hg ee BB 
Ce ee a ee ee ee oe ee 3 8. 2 Td 
Plan 9B.7 f=9 b= 4, res, b= 16, A = 3, 2 = A 
Reps I, Il, III, and IV Reps V, VI, VII, and VIII 
Loo 24a oe fe fo Fe a ee 
oe ee eae 28 a Oe oe Oe 
wee We: Paes Pa eee ee Sa Vea | ae See ea. See ae ees 
oe ee ee ee ee ee 5 ee ee See. et ee ee ee 
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Plan 9B.8 


t=9,k=5,r=10, b=18, \=5, E=.90 


Reps I Il, Ill, IV, and V 


2 


co 4 WN 
—_— Lf OC DN 


Plan 9B.9 


Reps I, Il, and III 


3 


N ON © 


4 


NN OO W 


hwWwW~)— MN 


6 


NON f 


Ww A — © +] 


oe HN CO 


nN CO A ~ 


Reps VI, VII, VII, IX, and X 


WOmnmwWn 


Ps 


Qo -_ 


2 


ONO fp — NN 


4 


COON W 


5 


-&N OO ~ 


6 


mM W ~)I 


a 


m OO CO f 


t=10,k=3,r=9, b=30,A = 2, E=.74 


Reps IV, V, and VI 


Reps VII, VIII, and LX 


Plan 9B.10 


t=-11,k=2,r=10,b=55,r’ = 


Reps I and II 


Reps Ill and IV 


Reps V and VI 


Reps VII and VIII 


Reps LX and X 


N 


5 


nn 


— 


~] 


oOo 


10 


ON 


—_" 


NO 


ioe) 


oo 


Be 


— 


7 


\O 


Ww HO & OO © 


So ON ee OND 


l 
4 
6 
8 
l 
3 
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Plan 9B.11 t=11,k=5,r=5,b=11, \ =2, E=.88 
Ls Tee F140 ue <2 
Fink 69 41 +7 a6 
34:61) oF @5 32 <4 
AcJ0 6 14 38 03 gil 
Se 312 FA cf 4a AD 
Plan 9B.12 t=11,k=6,r=6,b=11,\ = 3, E=.92 
6 Se A ee 
. a 3. eee) 
fy A ee ee 3 
ee ee 
{0c Dele & 4659 28 
ike Sel & 61S of 
Plan 9B.13 


Reps I, II, and III 1a ee: a Se 
Sw Sac he 7. & 


Reps IV, V, and VI 2 ee 
Ger Je 8 Bare 11 
So Garde: Ge 96 30 


8 
9 
1] 


io — 


8 
12 
Et 


9 
10 
12 


13 
12 


10 
1] 
13 


11 


10 
l 
13 


11 
LZ 
l 


Plan 9B.14 t=13,k=4,r=4,b=13,\=1, E=.81 
4 3 3 ae 
Pe a eer Se 
eS tee ee ee 
. 6 11-2 ee eS 3 


4 


5 


6 


it 


t=13,k=3,r=6,b=26,\ = 1, E=.72 


12243 
Poe 
7 

Sees 
4-5 
3. 4 
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Plan 9B.15 t=13,k=9,r=9, b=13,\=6, E=.96 
9 53. 84 UE 3G 87 6 68 ak es 
§ 6-59-15 95 S55 Be A 
S Be oe fo te 0 8 is se eS 
7 4% 46 BO 9 7 3. CG 
ee ee eee ee ee 
iO 1 we eS a ee 
ib) IS SP HL eos le 5 eee 8 10 
ie ig ee ee ee 
1 7 Oe a 5 6 7 8 ee 
Plan 9B.16 t=15,k=7,r=7, b=15, \=3, E=.92 
ia Xs 85 210 © eee ie 10 CG 
Gae G01 eR ee os 4 i 
DO We Wo 6 6283 124 5 Mee] is 
6 . Po SS = ee es is 1b ees ee 14 
7 8 2 es se 
fo * 4 8 ay 6's? 
eo @ G14 te S73 2-13 24s 
Plan 9B.17 t=15,k=8,r=8,b=15,A=4, E=.94 
pis all -ottys® 3) a 18 fo Se BT G43 
giicqgQhge 9045 ee Ge ii 1 3 6 G5 
eS 1 te ee 8 Fe Sg 
Sage a MP ee a Fg St 6 
ee ee ae ee ee 
3 Y ee SO ee GG 
(A 56 8 7 a Sis ee 
i ee 


9A.3 Appendix: Least Squares Estimates for BIB Designs 


The linear model for the balanced incomplete block design is 


Yigj = LTH + Pj + Ci; (9A.1) 


ee ee 


where su is the general mean, 7; is the treatment effect, p; is the block effect, and 


the e;; are random, independent experimental errors with mean 0 and variance le 
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Let n;; be an indicator variable with n;; = 1 if the ith treatment is in the jth block 
and n;; = 0 otherwise. 


The least squares estimates for the parameters in the full model minimize the 
sum of squares for experimental error: 


¢ : t b 
Q= > > Mise; - De De Nis (Yij — Ti — psy” (9A.2) 


The normal equations for the balanced incomplete block design are 


t b 
we NBt+r> Fitk) 3 =y. (9A.3) 
i=1 j=1 
b 
ih en yay 8S (9A.4) 
j=1 
t 
a tit me Pee Hy 5 = 1,2. ,b (9A.5) 


ii 


The set of t equations for the 7; and the set of b equations for the p; each sum to the 
equation for jz, resulting in two linear dependencies among the equations. The con- 
straints ;7; = 0 and 4p; = 0 can be used to provide a unique solution. 

After the constraints are imposed, the estimate of ys is 7 = y,,/N from Equa- 
tion (9A.3). The equations for p;, Equations (9A.5), are used to eliminate the P; 
from the equations for the 7; in Equations (9A.4). After elimination the equations 
with7, are 


b t b 
rk7; = ‘T; =e 3 a Migs ty = ky;. — = 59 (9A.6) 
j=1 p=l j=1 
pAt 


The right-hand side of Equation (9A.6) is kQ;, where Q; is the adjusted 
treatment total used to compute the adjusted treatment sum of squares. Since 


b 
> tyne = a tor py 4 (9A.7) 
j=l 
and né. = np; (Np; = 0 or 1), Equation (9A.6) can be written as 


t 
r(k—lti—-AY Fp = kQ; (9A.8) 
pl 
pF 
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Since condition {;7; = 0 has been imposed on the solution, the substitution 


can be made in Equation (9A.8). With the equality A(t — 1) = r(k — 1), the equa- 
tion for 7; is 


M7; = kQ; (9A.9) 


10 Incomplete Block Designs: 
Resolvable and Cyclic Designs 


A general description of and analyses for incomplete block designs were introduced 
in Chapter 9. Several major classes of useful incomplete block designs illustrated in 
this chapter include resolvable designs with blocks grouped in complete replications 
of treatments. Cyclic designs that can be constructed without the use of extensive 
tabled plans and the a designs that extend the number of available resolvable de- 
Signs for experiments are also discussed. 


10.1 Resolvable Designs to Help Manage the Experiment 


Resolvable designs have blocks grouped such that each group of blocks constitutes 
one complete replication of the treatments. The grouping into complete replications 
is useful in the management of an experiment. 

One of the early applications for resolvable designs occurred with plant- 
breeding trials placed in field plots on experimental farmland. Plant breeders 
wanted to test a large number of genetic lines and to make all comparisons among 
pairs of lines with equal precision. Arrangement of field plots into blocks smaller 
than a complete replication was necessary to reduce experimental error variance 
more so than was possible with the complete block designs. The resolvable incom- 
plete block design was attractive because it not only reduced the block sizes for 
greater precision, but it also allowed the researcher to manage these large studies in 
the field on a replication-by-replication basis. 

The resolvable designs can be useful in practice when the entire experiment 
cannot be completed at one time. The experiment in a resolvable design can be 
conducted in stages, with one or more replications completed at each stage. In 
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addition, should the experiment for any reason be terminated prematurely there will 
be equal replication of all treatments. 

The resolvable designs are arranged in r replicate groups of s blocks with k 
units per block. The number of treatments is a multiple of the number of 
units per block, t = sk, and the total number of blocks satisfies the relationship 
b=rs >t+r-— 1 for balanced resolvable incomplete block designs. 


Example 10.1 A Resolvable Design for Food Product Tests 


Consider an experiment to test nine variations of a food product on the same 
day with human subjects as judges. Several factors associated with this 
experiment make an incomplete block design attractive. First, a judge can be 
expected to adequately discriminate at most four or five food product samples 
at any one time. Due to scheduling problems, it is not possible to have a suffi- 
cient number of judges available on any one day for adequate replication of 
the treatments. New food preparations must be made each day of the test, and 
it is necessary to ensure day-to-day variation in food preparation does not 
interfere with treatment comparisons in the experiment. 

A test designed for three judges to each evaluate three of the food product 
variations on a given day is shown in Display 10.1. Three judges are sched- 
uled on each of four test days. The three judges each constitute one incom- 
plete block of three treatments. Each judge is randomly assigned to one of the 
blocks of three treatments and randomly presented with three food products 
for evaluation. 

The design is resolvable with one complete replication of the experiment 
conducted on any given day. The design is balanced since each treatment 
occurs with every other treatment one time in the same block somewhere in 
the experiment. The judges only evaluate three food products at one time, 
reducing the within-block variation considerably from what would have 
resulted if a judge been required to evaluate nine products at one time. 


Display 10.1 A Resolvable Balanced Incomplete Block 
Design to Evaluate Food Products 


Day I Day II Day IIT Day IV 


Judge Judge Judge Judge 
1 (12 3) 4 (1, 4, 7) 7 (Lh ee 
2 (4, 5, 6) 5 (2, 5, 8) 8 (2,6. tl 12, 4,9) 
3 ii o.9) 6 (3, 6, 9) 9 (ame i 3, 57) 
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Balanced Lattice Designs 


The balanced lattice designs are a well-known group of resolvable designs intro- 
duced by Yates (1936b). The number of these square lattice designs available is 
limited because the number of treatments must be an exact square t = k?. The 
designs require r = (k + 1) replications and b = k(k + 1) blocks for complete 
balance with 4 = 1. Each replicate group contains s = k blocks of k experimental 
units each. The number of units per block & must be a prime number or a power of 
a prime number. Plans for balanced lattice designs with t = 9, 16,25, 49, 64, and 81 
can be found in Cochran and Cox (1957) or Petersen (1985). 

The design in Display 10.1 is a balanced lattice with nine treatments in blocks 
of three units. The design has four replication groups for a total of 12 blocks. Note 
that each treatment pair occurs together in the same block \ = | time somewhere in 
the balanced design. 

Complete balance requires (k + 1) replications with the lattice designs. Since 
they are resolvable designs, one or more of the replicate groups can be eliminated 
to produce a partially balanced lattice design. Designs with r = 2, 3, or 4 replicate 
groups are known as simple, triple, or quadruple lattices, respectively. 

The average efficiency factor for a balanced lattice design is 


— (k+I(r-1) 
<= iat) 


and for simple and triple lattices the average efficiency factors are, respectively, 
(k + 1)/(k + 3) and (2k + 2)/(2k + 5). 


Rectangular Lattice Designs. 


The restriction on numbers of treatments or varieties with square lattices is quite 
severe. The rectangular lattice designs developed by Harshbarger (1949) pro- 
vided resolvable designs with treatment numbers intermediate to those provided by 
the square lattice designs. 

The rectangular lattice designs accommodate t = s(s — 1) treatments in blocks 
of size k = (s — 1) units. The treatment numbers fall about midway between those 
provided by the square lattices. Cochran and Cox (1957) have tabled plans for rec- 
tangular lattices with ¢ = 12, 20, 30, 42, 56, 72, and 90 treatments. Designs with 
r= 2 and r =3 replicates are known as simple and triple rectangular lattices, 
respectively. ; 

Other resolvable balanced incomplete block designs exist for numbers of treat- 
ments without the restrictions of the square or rectangular lattices; however, they 
exist for a limited set of block sizes and treatment numbers. Some balanced resolv- 
able designs for ¢ < 11 treatments are cataloged in Appendix 9A.1. Other balanced 
resolvable designs for t > 11 are given by Cochran and Cox (1957). 
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10.2 


INCOMPLETE BLOCK DESIGNS: RESOLVABLE AND CYCLIC DESIGNS 


Analysis Outline for Resolvable Designs 
The linear model for the resolvable design is 


Yijm = LT +5 + Pm(j) + Cig (10.1) 


te=cigeiaaintin: FE Ao mes Eek, s 


where jy is the general mean, 7; is the treatment effect, ;; is the replicate group 
effect, p,,(;) 1s the block nested within replication effect, and e€;j, is the random 
experimental error. 

The sequential fit of alternative models outlined in Chapter 9 provides an 
orthogonal sum of squares partition. An additional sum of squares partition is re- 
quired for replicate groups. The blocks are nested within replicate groups and the 
sums of squares for blocks from the replicate groups are pooled as 
SS(Blocks/Reps). The intrablock analysis of variance outline is shown in Table 
10.1. 


Table 10.1 Intrablock analysis for a resolvable balanced incomplete block design 
Source of Degrees of Sum of 
Variation Freedom Squares 
Total Net ee 
Replications r—1 sk) @;- 9.) 
Blocks (unadjusted) within replications r(s — 1) Rg. Te yy 
Treatments (adjusted) $4 i O° 

At 
Error N-t-rs+1 By subtraction 


Q; = yi, — (Bi/k), and B; is the sum of block totals that include the ith treatment. 


The treatments are orthogonal to the replicate groups since each treatment ap- 
pears one time in each of the replicate groups. The treatment totals adjusted only 
for block effects provide unbiased estimates of the treatment means and a valid F’ 
test for treatment effects. 


Resolvable Row-—Column Designs for Two Blocking 


Criteria 


Some research settings require two blocking criteria and do not permit complete 
blocks of treatments for either the row or column blocks required by the row 
orthogonal Youden squares. Row—column designs have the ¢ treatments placed into 
blocks of k = pq units. The treatments in each block are then arranged in a p x q 
row—column design. 
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A limited number of balanced row—column incomplete block designs exists for 
experiments that require block sizes of both rows and columns to be less than the 
number of treatments. The designs require a large number of blocks to meet the re- 
quirement of balance. See John (1971) for an extended bibliography of designs. 


Example 10.2 A Nested Row—Column Design to Sample Insect 
Populations 


Consider an epidemiological study on the transmission of a microbial parasite 
to humans by an insect in an agricultural area of the tropics. The microbial 
parasite is passed to the human when the insect bites the human. In turn, if the 
insect is not infected by the parasite it may become infected when it bites an 
infected human. A public health project is planning to test the effectiveness of 
a drug that is known to disrupt the parasite’s life cycle in humans. An ento- 
mologist working with the project is going to sample the insect populations to 
monitor the effect the program has on reducing their infection rate. 

Nine types of habitats in and around each of four agricultural plantations 
are scheduled for monitoring in the project. The habitats consist of locations 
such as flowing streams, villages on the plantation, the agricultural fields, 
stagnant water ponds, and so forth. The entomologist can only collect insect 
samples at three such habitat sites in a single day. A row—column sampling 
design is set up to control the potential variation caused by sampling on dif- 
ferent days and different times of day at each of four plantations. The design 
is shown in Display 10.2. The design has nine habitat treatments sampled in a 
3 x 3 row—column design at four plantations. The three rows in the 3 x 3 ar- 
ray are the time of day the samples are collected, and the three columns are 
the three days for collection at each plantation. 

Upon inspection of each habitat treatment pair it can be verified that each 
is sampled once at the same time and once on the same day in the design. 
Each treatment pair must occur together one time in the same row and one 
time in the same column somewhere in the design to have a balanced row— 
column design. The design is a resolvable design with plantations as replica- 
tions and one complete replication of the nine habitat types at each plantation. 
It is a nested design wherein the rows and columns are nested within the 
replicate groups. 


Balanced Lattice Square Designs 3 

These designs, introduced by Yates (1940b), are resolvable nested row—column de- 
signs developed early in the history of incomplete block designs. The example 
exhibited in Display 10.2 is such a design. The balanced lattice square has t = k? 
treatments laid out in a k x k row—column design. One k x k array consists of one 
complete replication of the treatments with p= q=k. The balanced design re- 
quires r = (k + 1) replications for complete balance such that every treatment pair 
occurs together one time in the same row and one time in the same column 
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Display 10.2. A Nested 3 x 3 Row—Column Incomplete 
Block Design to Sample Insect Populations 


Day 
Time. (7) 42) 73) dime 1) Op eee 
(1) | lo (4) l 4 7 
(2) oe eee (5) Zz 5 8 
(3) 7 S209 (6) oa 6 9 
Plantation I Plantation II 


Day Day 
Time yay Time (10) “Gi 2) 
C7¥ 1 os (10) ] 9 5 
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(9) 5 be (12) 8 4 3 
Plantation III Plantation IV 


somewhere in the experiment. When k is an odd value, a replication value of 
r= 5(k +1) will provide semi-balance such that any pair of treatments occurs 
together one time in the same row or one time in the same column. 

Plans for lattice square designs with t = 9, 16, 25, 49, 64, 81, 121, and 169 
may be found in Cochran and Cox (1957) and Petersen (1985). Extensive discus- 
sions of the designs are found also in Kempthorne (1952) and Federer (1955). 


Analysis Outline for Nested Row—Column Designs 
The linear model for any resolvable nested row—column design is 
Yijlm = + Bm + Pim) + Ym) + Ti + Cijtm (10.2) 


pel ot Pee eee ee 2 eee 


where p is the general mean, 7; is the treatment effect, G,, is the replicate group 
effect, Pm) and Ym) are, respectively, the row and column nested in replicate 
group effect, and e;,1 is the random experimental error. 

Each of the treatments occurs in each replicate group, and the treatments are 
orthogonal to replicate groups. The rows and columns also are orthogonal to repli- 
cate groups. The sums of squares partitions for replications, rows within replica- 
tions, and columns within replications can be computed in the usual manner, as 
shown in the analysis of variance outline in Table 10.2. The treatments are not 
orthogonal to rows or columns of the replicate groups. The treatments must be 
adjusted for both row and column effects. 

The adjusted treatment totals required for least squares estimates of treatment 
effects and the adjusted treatment sum of squares are 
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Table 10.2 Intrablock analysis for a nested row—column incomplete block design 


Source of Degrees of Sum of 
Variation Freedom Squares 
Total 3 — 2 i - 2) 
Groups (replications) r—1 17, - yy 
Columns (unadjusted) r(q—1) Pst i. = YY 
within groups 
Rows (unadjusted) r(p — 1) G2 9 im Vind 
within groups 
Treatments (adjusted) t—1 iat ee 5“? 
r= 1g — 1) 
Error r(p— 1)(q—1)—(t-—1) _ By subtraction 
l l 
CO =e (10.3) 
P q 


where y;,. is the total for the 2th treatment, y is the grand mean for the experi- 
ment, A; is the sum of row totals that includes the 7th treatment, and C; is the sum 
of column totals that include the ith treatment. 

The least squares estimate of the treatment effect is 


Tg cas _ pat — Qi (10.4) 


Tap tg = 2) 
and the least squares estimate of the treatment mean is 1; = ji +7;, where 7 = 7 __ 
is the estimate of the general mean. | 
The efficiency factor for the nested row—column design is 


eee 1) 
put = 1) 


where for the balanced lattice square the efficiency factor is E = (k — 1)/(k + 1), 
with p = q =k and t = k?. For example, with k = 3 in Display 10.2, the efficiency 
factor is EK = 2/4 = .5S0. The balanced lattice square would have to reduce experi- 
mental error variance by 50% to estimate comparisons between treatment means as 
precisely as a complete block design of the same size. 


(10.5) 


10.3 Cyclic Designs Simplify Design Construction 


Extensive tables of design plans must be available to use the incomplete block de- 
signs discussed to this point. Setting up any design in the physical facility required 
for the experiment demands constant vigilance. Care must be taken to avoid mis- 
takes when treatments are assigned to experimental units and when data are 
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recorded. Complex incomplete block designs can add further complication to the 
process if there is a need for constant reference to the design layout. 

Cyclic designs offer a degree of simplicity in the construction of the design and 
implementation in the experimental process. Cyclic designs are generated from an 
initial block of treatments so it is not necessary to table complete plans of the 
experimental layout. Once the initial block is known the experimental plan can be 
generated, and the treatments can be randomly assigned to the numerical labels in 
the design. 


How to Construct a Cyclic Design 


The cyclic designation refers to the method used to construct the designs as well as 
to the association scheme among the treatments. The treatments are assigned to the 
blocks through a cyclic substitution of treatment labels from an initial generating 
block. 

The method requires that the treatments be labeled as (0,1,2,...,¢— 1). 
The cycle begins with an initial block of size k required by the experiment. Suc- 
ceeding blocks are obtained by adding 1 to each treatment label in the previous 
block. If a treatment label exceeds (t — 1) it is reduced by modulo t. The cycle con- 
tinues until it returns to the configuration of the initial block. 

Consider the design for six treatments in blocks of three units generated by cy- 
clic substitution in Display 10.3. The six treatments are labeled (0, 1,2,3,4,5) for 
the design. A cyclic design is constructed with the initial block (0, 1,3). The second 
block is constructed by adding 1 to each treatment label in the first block which 
gives (1,2,4) as the configuration for the second block. Add | to each treatment 
label in the second block to provide (2,3, 5) as the treatments for the third block. 


Design 10.3 A Cyclic Design for Six Treatments 
in Blocks of Three Units 


Block 


l (0, 1, 3) initial block 
2 (1, 2, 4) 
3 (2,3; 5) 
4 (3, 4, 0) 
5 (4,5, 1) 
6 (5, 0, 2) 


The transition from block 3 to block 4 in the cycle requires one label in block 4 
to be reduced by modulo t = 6. If 1 is added to each label in block 3 the configura- 
tion of block 4 will be (3, 4,6) where 5 + 1 = 6 exceeds (t — 1) = 5. Therefore, 6 
must be reduced modulo 6, which is 6 = 0(mod 6), and 0 replaces 6 in the block 4 
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configuration to give (3,4,0). A similar replacement occurs in the transition from 
block 5 to block 6. | 

A full set of b = 6 blocks is constructed in the complete cycle shown in the 
display. The numbers make one complete cycle through the treatment labels down 
each of the columns in the display. The treatment labels in the first position of the 
blocks cycle from 0 to 5 while the second position cycle begins with the label 1 and 
the third position cycle begins with the label 3. The design is complete with blocks 
1 through 6 since the seventh member of the cycle would be the initial block con- 
figuration, (0, 1, 3). 


Construct Row—Column Designs with the Cyclic Method 


Cyclic designs can be used for row—column designs if the experiment requires two 
blocking criteria. The only requirement is that the number of replications be some 
multiple of the block size, or r = 7k, where 7 is an integer value. The design in Dis- 
play 10.3 with t = 6 and r = k = 3 generated from the initial block (0, 1,3) can be 
used as a row—column design with six rows and three columns. 

The design is a column orthogonal design since each treatment appears in each 
of the columns. If the row and column integrity is maintained upon randomization, 
then a sum of squares for columns and rows is partitioned out of the total sum of 
squares. The treatments are adjusted for row blocks in the analysis. 


Tables to Construct Cyclic Designs 


A compact table of initial blocks required for relatively efficient cyclic designs with 
4 <t < 15 treatments, from John (1987), can be found in Appendix Table 10A.1. 
An additional table for 16 < t < 30 treatments also was provided by John (1987). 
Other tables can be found in John (1966). 

For a given block size k, the first k — 1 treatments in the initial block are all 
taken to be the same for any number of treatments t. The final treatment in the 
block is taken as the corresponding value in the column for t in Appendix Table 
10A.1. For example, with k = 3, r = 3, the initial block for t = 5 is (0, 1,2) and 
for t = 6 the initial block is (0, 1, 3). 

If a design with more replications is aoe (r > k), a second- or third-gen- 
erating block is given in the table. A set of blocks is constructed from each by cy- 
clic substitution. The design with t =6 treatments in blocks of size k =3 
generated with the initial block (0,1,3) had r = 3 replications. The design with 
r = 6 replications is constructed by adding the initial block (0,2, 1) found in Table 
10A.1 on the line with k = 3 and r = 6. The resulting design has b = 12 blocks, 
half of them generated from each of the initial blocks as shown in Display 10.4. 
The design is partially balanced with A; = 3 and A» =2 for the two associate 
classes. 
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Display 10.4 A Cyclic Design Generated from 
Two Initial Blocks 


Block 
(0, 1, 3) initial block (0, 2, 1) initial block 


(1, 2, 4) (ioe 2) 
EPs eg (2, 4, 3) 
(3, 4, 0) (3, 5, 4) 
(4,5, 1) (4, 0, 5) 
(560352) (53:40) 


a Designs for Versatile Resolvable Designs from Cyclic Construction 


The a designs are resolvable designs developed by Patterson and Williams (1976) 
in response to the requirements for statutory field trials of agricultural field crop 
varieties in the United Kingdom. The number of varieties and replications were 
fixed by statutory requirements and were not under the control of the experi- 
menters. The large number of varieties necessitated incomplete block designs, and 
resolvable designs were required for proper management in the field. Existing re- 
solvable designs did not always accommodate the number of varieties or block 
sizes required by the trials. 


a Design Features 


The designs have no limitation on block size except for the constraint that the 
number of treatments, t, must be a multiple of block size k, so that t = sk, to have 
a resolvable design with equal block sizes. Tabled plans of the complete designs are 
unnecessary since they can be generated by cyclic substitution from an initial array 
of numbers much like the cyclic designs. 

The development of the a designs greatly increased the number and flexibility 
of resolvable designs available for experimental trials. The designs originally 
developed by Patterson and Williams (1976) for the statutory trials included 
designs with r = 2, 3, or 4 replications with t < 100 treatments and block sizes 
a Se S10. 

The designs are partially balanced with two or three associate classes. The de- 
signs with two associate classes having A; = 0 and A2 = | are denoted a(0, 1) de- 
signs, and designs with three associate classes having A; = 0, Az = 1, and A3 = 2 
are denoted a(0, 1,2) designs. Most of the designs considered for the variety trials 
were a(0, 1) designs. Because of the manner in which the designs are constructed 
neither balanced designs nor a(1, 2) designs exist. 

The upper bound for the efficiency of the a designs provided by Patterson and 
Williams (1976) is 
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a) 


ee ee ee 


(10.6) 


How to Construct a Designs 


The construction of a design for t = sk treatments in r replications begins with 
ak x r array designated as the generating a array. Each column of the generating 
q@ array is utilized to construct s — 1 additional columns by cyclic substitution. The 
new k x rs array is denoted as the intermediate a* array. 

Consider a 4 x 3 generating a array for t = 12 treatments, k = 4 units per 
block, r = 3 replications, and s = 3 blocks per replication. 


Generating a Arrays 


Column 
a 
0 <0 -~D 
Of) 2 
a2 4 
O-si 4 


Each column is used to generate s — 1 = 2 additional columns by cyclic sub- 
stitution with mod(s) = mod(3) to derive the intermediate a* array. The result will 
be r = 3 arrays of size k x s. In this example, there will be three 4 x 3 arrays. The 
intermediate a* arrays generated from each of the three columns of the a array are 


Intermediate a* Arrays 


Array Generated From 
Column I Column 2 Column 3 
ae ee Zee See. 0 
Z 
l 
1 


— — 
NN WN 
NON Oo = 
oo — N 


0 2 
0 ] 
0 0 


Finally, treatment labels are obtained by adding s = 3 to each element of the 
second row of the intermediate a* array, 2s = 6 to each element of the third row, 
and 3s = 9 to the fourth row. The columns of the arrays are the blocks of the de- 
sign, and each set of s columns generated from a generating a array column consti- 
tutes a complete replication. The resulting a(0,1,2) design is shown in Display 
b0:3: 

The treatment labels for the design are (0,1,2,...,¢— 1), and the actual 
treatments are randomly allocated to the labels. In addition, randomization proceeds 
with a random allocation of the design blocks to the actual blocks within the 
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Design 10.5 Ana Design for 12 Treatments 
in Three Replication Groups 


Replicate 
Block 


replicate groups and a random allocation of the treatments to the actual 
experimental units within the physical blocks. 

A table of 11 basic a arrays reproduced from Patterson, Williams, and Hunter 
(1978) is shown in Appendix Table 10A.2. A total of 147 a designs can be gener- 
ated from the 11 arrays. The arrays can be used to generate designs for treatment 
numbers 20 < t < 100 with r = 2, 3, or 4 replications and the usual constraint that 
t = sk. The arrays will produce designs with block sizes k > 4 with the condition 
k<s. 

Consider the first array listed in Appendix Table 10A.2 for s = k = 5. The ar- 
ray has k = 5 rows and r = 4 columns. 


So co 2 © & 
kh WN — © 
me NO WwW kh © 
We BN © 


This particular array can be used to generate an a design with r = 2 replications by 
utilizing only the first two columns of the array. Likewise, designs for three or four 
replications can be generated by using the first three or all four columns, respec- 
tively. A design with block size k = 5 is obtained by utilizing all five rows of the 
array. A design with block size k = 4 is constructed by utilizing the first four rows 
of the array. With s = 5 a design may then be constructed for ¢ = 25 treatments in 
blocks of k = 5 units with r = 2, 3, or 4 replications. Also with s = 5S a design 
may be constructed for t = 20 treatments in blocks of k = 4 units with r = 2, 3, or 
4 replications. 


Latinized Resolvable Incomplete Block Designs 


The resolvable designs discussed to this point have a nested blocking structure. The 
incomplete blocks are randomized independently within each replicate whether the 
designs have a single blocking criterion or two blocking criteria with rows and 
columns. 
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In some instances, the blocks of one replicate may have a relationship to blocks 


in another replicate. These relationships could exist in experiments wherein the 
blocks in different replicates are contiguous in space, as illustrated in Display 10.6. 


Display 10.6 Illustration of Contiguous 
Blocks in the Replicates 


Replicate I 


Replicate II 


Replicate X 


If the spatial layout is that shown in Display 10.6, then any single block—e.g., 
Block 1—in each replicate occurs in the same spatial line as Block 1 in each of the 
other replicates. The result is the occurrence of a “long” block in the vertical direc- 
tion of the layout in Display 10.6. 

If blocks are randomized separately in each replicate, it is entirely possible for 
all replications of one or more of the treatments to occur in the same “long” block. — 
This could create an undesirable circumstance in the case of unforeseen 
disturbances which negated the use of the “long” block. To avoid this possibility 
Harshbarger and Davis (1952) introduced Latinized rectangular lattice designs. The 
rectangular lattice was structured such that each treatment occurred only once in 
each long block. 

Williams (1986) presented results on more general designs in which treatments 
do not occur more than one time in the long blocks and extended the Latinized de- 
signs to include row—column blocking. The a design can be used to generate the 
more general Latinized designs as illustrated by John and Williams (1995) with a 
Latinized row—column design shown in Display 10.7. Notice that treatments appear 
no more than one time in each of the long blocks (columns). John and Williams 
(1995) provide a thorough discussion of cyclic and resolvable designs. 
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2 Display 10.7 A Latinized Row—Column Design 


Block 
2 
Replicate I 


5 
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Replicate II 


Replicate Ill 


10.4 Choosing Incomplete Block Designs 


A successful experiment will include the correct treatment design to address speci- 
fic research questions and provide precise estimates of the treatment means and 
contrasts of interest to the investigator. The incomplete block design often is neces- 
sary to obtain the best possible conditions for comparisons among the treatments 
with as homogeneous groups of experimental units as possible. 

The investigator’s knowledge of the experimental conditions and material is 
indispensable in constructing suitable block sizes and composition. Each experi- 
ment presents its own challenges and conditions, and no set rules provide the cor- 
rect design for every experiment. A knowledgeable investigator can craft a 
successful experiment with the intelligent application of sound design principles to 
the experimental material. 

The guiding principle in selecting or constructing a design for a specific re- 
search problem is to make every attempt to choose a design that accommodates the 
research problem. In most cases, appropriate designs are available to accommodate 
the requirements of the experiments because such a large number of published 
plans for balanced incomplete block designs are available in the literature as a re- 
sult of the research on design construction. 

The systematic methods to construct balanced incomplete block designs re- 
quire mathematics beyond the scope of this book. Discussions of research on de- 
sign existence and construction can be found in John (1971), John (1987), and John 
and Williams (1995). 
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Where to Find Design Plans 


Computer programs provide the most convenient method to develop an incomplete 
block design plan; CycDesigN (Whitaker, Williams, & John, 1998) and ALPHA+ 
(Williams & Talbot, 1993) are two examples of programs that generate designs dis- 
cussed in Chapters 9 and 10. 

Tabled plans for small experiments with t < 11 treatments are found in Appen- 
dix 9A.1. Plans for other numbers of treatments are given in Cochran and Cox 
(1957), Cox (1958), Fisher and Yates (1963), Box, Hunter, and Hunter (1978), and 
Mason, Gunst, and Hess (1989). Plans for balanced and partially balanced lattice, 
rectangular lattice, and lattice square designs may be found in Cochran and Cox 
(1957) and Petersen (1985). Plans for designs with row—column blocking as row 
orthogonal incomplete Latin squares may be found in Youden (1940), Cochran and 
Cox (1957), Cox (1958), and Petersen (1985). Some of the plans for row orthog- 
onal designs are given in Appendix 9A.2. A large number of partially balanced 
incomplete block designs can be found in Clatworthy (1973) and Bose, Clatworthy, 
and Shrikande (1954). 


An Informal Approach to Design Construction 


In the cases where a standard design does not accommodate the research problem, 
it may be necessary to find an innovative design solution to avoid alterations in the 
treatment design. The need for occasional innovation argues for a more informal 
approach to experiment design (Mead, 1988). The primary reason for advocating 
the informal design is to avoid the trap of changing the treatment design or the re- 
search problem to fit some particular tabled design. Changing a research problem to 
fit the experiment design would be unwise under any circumstances. However, it is 
advantageous to understand some of the more formal properties of experiment de- 
sign to fully appreciate innovative approaches and avoid possible pitfalls of the in- 
formal approach. The investigator must be quite familiar with the experimental 
conditions and material to construct a design appropriate to the study. 

The construction can be quite informal if great care is taken in the placement 
of treatments with experimental units to avoid mishaps. Particular attention must be 
paid to the amount of information available for various treatment comparisons as 
the design is constructed. The relative differences in variances of the pertinent treat- 
ment comparisons must be evaluated prior to actual experimentation. Such an ap- 
proach is advocated by Mead (1988). 


The Efficiency Factor Increases with Block Size 


We expect the experimental error variance to decrease as the block size decreases. 
The decrease in a” with smaller block sizes depends on the success we have in 
placing the experimental units in the correct blocking arrangement. However, a 
close inspection of the efficiency factor suggests taking as large a block size as 
possible. The efficiency measure for a balanced incomplete block design can be 
expressed as 
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E=>=—- (1-5) (10.7) 


The value of & increases with increasing block size k. A cursory scan of the list of 
balanced incomplete block designs in Appendix 9A.1 will confirm this relationship. 
A similar relationship is true for the partially balanced incomplete block designs. 


Let Resources Dictate Block Size 


The available resources dictate the choice of block size. In some experiments the 
block size is constrained to the available instruments for the experiment as was the 
case for the methyl glucoside vinylation experiment in Example 9.2. The investiga- 
tor only had three chambers available to test five pressure treatments. With 
t=5,r=6,k =3, and A =3, the efficiency factor for the balanced incomplete 
block design was E = 3(5)/6(3) = 0.83. Equivalent precision to a complete block 
design with five pressure chambers would require a 17% reduction in o” for the 
incomplete block design. However, the only choice available to the investigator 
was the incomplete block design with three treatments per block. 

Some experimental conditions allow greater flexibility in the choice of block 
sizes. Variety yield trials are placed in field plots on experimental farmland. In 
many of these situations the block sizes can be of any convenient size. The guiding 
principle is to have a sensible block size that provides as much homogeneity among 
the plots within blocks as is necessary for precise comparisons among the varieties. 
In this case the block size should be as large as possible while maintaining the re- 
quired precision for variety comparisons. 


Partially Balanced Designs Increase Efficient Use of Resources 


Partially balanced designs can be used in place of balanced designs to utilize 
experimental resources more efficiently. Consider the partially balanced design in 
Display 9.3. The block sizes in the experiment were constrained to four experi- 
mental units. A balanced design would have required ten replications of the six 
treatments and 15 blocks. The 3 blocks of the partially balanced design provide two 
replications. If more replications are required the partially balanced design can be 
repeated. Unless ten replications are required the partially balanced design requires 
fewer resources than the balanced design. 

The investigator had to make some sacrifice to have the smaller experiment. 
The partially balanced design provides more precision for some comparisons than 
others because not all the treatment pairs occur in the same number of blocks. 
However, with a prudent choice of design it is still possible to have a partially bal- 
anced design that provides near balance in all comparisons. 

The partially balanced design in Display 9.3 had some treatment pairs occur- 
ring in A; = 2 blocks and other treatment pairs occurring in Ay = 1| block. The effi- 
ciency for the comparisons among first and second associates can be shown to be 
Ff, = 1.00 and FE» = 0.86, respectively, with an average efficiency of £ = 0.88 
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(Clatworthy, 1973). There is near equal precision for both sets of associates, and 
only a small amount of precision is sacrificed with the partially balanced design. 


a Designs Relieved Constraints for Large Experiments 


The incomplete block designs originated with experimental trials that included rela- 
tively large numbers of treatments such as variety trials. The primary objectives of 
the incomplete block designs were to reduce the block sizes to reduce experimental 
error variance and to have experiments that could be managed on a replicate group 
basis. In agricultural field trials the block sizes were somewhat flexible as long as a 
reduction in block size managed to reduce the error variance. The original square 
and rectangular lattice designs provided a degree of relief for these situations. The 
variety numbers for these trials were still quite restrictive even though it often was 
possible for the investigators to adjust the number of included varieties without dire 
consequences. 

The a designs relieved the constraint on the number of treatments for resolv- 
able incomplete block designs to a considerable extent. They do still restrict the 
number of treatments to be a product of the number of units per block and the num- 
ber of blocks in a replication group. However, it is now possible to have a resolv- 
able design for a greater variety of treatment, block, and replication sizes. In 
addition, the a designs can be generated from a relatively simple set of tables. 


Factorial Treatment Designs Require Special Attention 


Structured treatment designs such as factorials require special attention when con- 
structing incomplete block designs. The construction of incomplete block designs 
with factorial effects confounded with blocks will be considered in Chapter 11. 
These designs are particularly useful for the 2” series of factorials that have many 
factors because no great sacrifice results from completely confounding large order 
interactions with blocks. 


Cyclic Designs Are Helpful with Unstructured Treatment Designs 


Balanced designs or resolvable designs can in some circumstances strain available 
resources because of the required number of replications. The more flexible partial- 
ly balanced designs can be used in place of the balanced design with little loss 
in precision. However, the tables of partially balanced designs may not be readily 
available because many of them appear in older publications. Many of the tabled 
designs found in these publications can be constructed by cyclic substitution. Cy- 
clic designs may be one of the best alternatives for producing an efficient design. 
They provide designs of many different sizes and are constructed easily from one 
or two initial blocks of treatments. The tables of initial blocks for cyclic designs 
appear in somewhat more recent publications than the other classes of designs. 
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1. A horticulturalist conducted a field test of t= 8 broccoli varieties with a resolvable balanced 
incomplete block design in a field experiment. The design had k = 4 plots per block and s = 2 
blocks per replication. There were r = 7 replicate groups in the design. The data are pounds of 
broccoli harvested per plot. (R = replication, B = block within replication, 7 = variety) 
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a. Write a linear model for this experiment, describe the terms, and conduct the intrablock 
analysis. 

b. Compute the standard error estimate of the difference between least squares estimates of two 
treatment means. 

c. What is the efficiency factor for this design? 

d. Write asummary of your analysis results and evaluation of the design. 


2. An agronomist conducted an alfalfa variety trial in a quadruple lattice design. There were 25 varie- 

ties grown in four replicate groups. Each replicate group consisted of five blocks of five varieties in 

a square lattice. The yield data that follow are pounds of alfalfa hay harvested per plot. The variety 
numbers appear in parentheses. 


2 18.6 
3 20.8 
4 21.0 
5 19.6 


eG 


(2) 
(8) 
(15) 
(17) 
(24) 


20.4 
18.3 
19,5 
19.4 
19.0 


(1) 
(6) 
(11) 
(16) 
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Replicate I 
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Replicate II 


] 19.4 
2 18.6 
3 16.6 
4 21.8 
5 21,2 
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2 19.5 

3 13.5 

4 22.6 

5 20.0 
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Source: Dr. M. Ottman, Department of Plant Sciences, University of Arizona. 


(11) 
(17) 
(18) 

(4) 
(15) 


(1) 
(21) 
(9) 
(10) 
(24) 


(12) 
(24) 
(25) 
(15) 

(8) 


19.8 
20.8 
19.8 
20.6 
19.6 
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19.1] 
23.1 
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(21) 
(2) 
(3) 

(14) 

(20) 


(13) 
(14) 
(22) 
(23) 
(18) 
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(4) 
(10) 
(13) 
(19) 
(21) 


(6) 
(12) 
(23) 
(19) 
(25) 


Replicate III 


19.3 
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(3) 
(9) 
(12) 
(18) 
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(1) 
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(25) 
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13.6 
19.3 
18.9 
20.9 
16.9 


EXERCISES 357 


(5) 
(7) 
(14) 
(20) 
(22) 


(16) 
(22) 
(13) 
(24) 

(5) 


(19) 
(8) 
(16) 
(11) 
(6) 


(9) 
(2) 
(14) 
(4) 
(19) 


a. Write a linear model for the experiment, explain the terms, and conduct the intrablock analysis 
for a simple, triple, or quadruple lattice design (instructor discretion). 
b. Compute the standard error estimates of the difference between intrablock estimates of two 
variety means that are first associates and between those that are second Sisson tase Also com- 
pute an average standard error. 
c. What is the efficiency factor for the simple or triple lattice design? Interpret the efficiency 


factor. 


_d. Write a summary of your analysis results and evaluation of the design. 


An agronomist conducted a wheat variety test in a balanced lattice square design. There were t = 9 
varieties in a3 x 3 balanced lattice square in r = 4 replicate groups. The wheat yields are shown in 
the table that follows in the row—column arrangements for each replicate group. Variety numbers 


appear in parentheses. 
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Replicate I Replicate II 
53.516) (83.2 (4)" SAFO) 53.7 (4) 53.6(2) 57.8 (9) 
53.443) (58.6 (2). 53:9 Q) 545 (3) <52.847) 53.35) 
57:2 9). 433.0 (F=S13 48) 48.9(8) 53.5(6) 56.7 (1) 

Replicate III Replicate IV 
49.4(8) 54.7(4) 55.6 (3) 34.007). 37:20) 53.24) 
54.5(6) 54.2(2) 54.4(7) 56.9(9) 54.8(3) 55.4 (6) 
5971) (35.7 (854.9) 48.9(8) 53.4(2) 55.9 (5) 


a. Write a linear model for this experiment, describe the terms, and conduct the intrablock analy- 


sis for this experiment. 

b. Compute the standard error estimate of the difference between least squares estimates of two 
treatment means. 

c. What is the efficiency factor for this design? 

d. Write a summary of your analysis results and evaluation of the design. 


4. A variety trial was conducted in an a(0,1,2) resolvable design. There were t = 18 varieties in 
r =4 replicate groups. There were s = 3 blocks of k = 6 varieties in each replicate. The yield 
data follow with the variety numbers in parentheses. Varieties 1 and 5 were control varieties. 


Block Replicate I 
l 88.2 (5). 82.5 (10): 843.448) 87.0 (6) 845 (12) 3889 (8) 
2 $2.4, (1) 82.9 (14) - 83.1 — 43) «84.7 -G3) 83:3 -@o) 89.0 (4) 
3 93.1s a2) -82:7 7 C1) 288.9 > G7) ebh6 eS) 84.1 2308) 875 7) 
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$5.45 (4) 730 (hl) 84.2 ©) 605 114) 979.6 410): 86.0 ©) 


2 87.9 (8) a 864 2(0)\ 79 Ae B pal Ch ela): 89:3 > NSO BLS: GG) 
3 $2.4 11). 885 2 2). 87.02.4127) 22 4-017 854 (05). 7a 266) 
Block Replicate III 


L | 83.66) 19.45 (1) 81.3 4G) 8059-9) 809 « (8). 19.3250) 
2 | 804. (7) 882 - (5) 823 (14) 880 G2) 2900.r @pr88 me 2716) 
ii + lode! (18% 298482415)" STO SOR VIB! 9) 7015 {TY BRB 8d 16) 


Block Replicate IV 
l 80:5 (16) Til €11) 844: (277-94 (6) 829 414) 828 - ci) 
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Source: P. Seeger, Department of Statistics, The Swedish University of Agricultural Sciences. 


a. Identify the first, second, and third associates of variety 1, where Ay = 0, Ap = 1, and A3 = 2. 
How many of each are there, and which varieties are in each group of associates? 
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Write a linear model for the experiment, explain the terms, and conduct the intrablock analysis 
to obtain intrablock estimates of the variety means and their standard error estimates. 

The a@ designs have a multiplicity of standard errors for the differences between pairs of esti- 
mated treatment means. If your program is capable, compute the standard error estimates of 
the difference between intrablock estimates of the control variety means, varieties 1 and 5, and 
the other varieties; the average standard error for the first, second, and third associates; and an 
overall average standard error. 

Compute the standard error estimates of the difference between the variety means; an average 
standard error of the difference for first, second, and third associates; and an overall average 
standard error. 

Write a summary of your analysis results and evaluation of the design. 


A human factors study is to be conducted on speed of perception relative to object shape in statisti- 
cal graphs. Eight shapes will be used as treatments in the study. It is thought that a subject should 
evaluate no more than four shapes in any one sitting. Researchers have decided to use an incom- 
plete block design with subjects as blocks. 


a. 


b. 


Construct a cyclic design with r = 4 replications of t = 8 object shape treatments. Use human 
subjects as blocks with k = 4 treatments per subject. 

Randomize the blocks of treatment labels to subjects, and randomize the actual object shape 
treatments to the treatment labels. 

Suppose the order of presentation to the subjects could be an important source of variation. 
Construct the design as a row—column design, and randomize accordingly. 

Suppose eight replications were required for the study; construct the cyclic design with eight 
replications. | 


Generate an a design for three replications of 35 treatments in blocks of five plots. 


a. 


b. 


Show the generating a array, the intermediate a arrays, and the final alpha design with treat- 
ment labels. 

Randomize the incomplete blocks of treatment labels to actual blocks, and randomize the treat- 
ment labels to the units within a block. 

Randomly assign the actual treatments to the treatment labels. 
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10A.1 Appendix: Plans for Cyclic Designs 


Table 10A.1 Initial blocks to generate efficient cyclic partially. balanced 
incomplete block designs for 4 < ¢ < 15,r < 10 
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Reproduced with permission from J. A. John (1981), “Efficient Cyclic Designs.” Journal of the Royal Statistical Society B, 43, 
76-80. 
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igns 


Generating Arrays for a Desi 


10A.2 Appendix 


Table 10A.2 Generating Arrays (k x r) for a Designs 


g=e_= 10 


S= 13.2 7 


s=12,k=8 


s=11,k=9 
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1] 


10 
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1] 


IZ 
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11 


10 
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14 
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10 


11 


10 
14 


12 


1] 


Reproduced with permission of Cambridge University Press from H. D. Patterson, E. R. Williams, and 


11 


E. A. Hunter (1978), Block designs for variety trials. Journal of Agricultural Science 90, Table 2, p. 


399. 


11 Incomplete Block Designs: 


Factorial Treatment Designs 


The versatility of factorial treatment designs is explored in this chapter and the next 
with emphasis on 2” and 3” factorials. The effects and notations specific to factori- 
al treatments are used to develop methods to construct incomplete block designs 
for factorial treatment designs in this chapter. 


11.1 Taking Greater Advantage of Factorial 
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Treatment Designs 


The factorial treatment design was discussed in Chapters 6 and 7 as a means for 
investigating the effects of several treatment factors in the same experiment. The 
primary advantage of the factorial arrangement resides in the ability to determine 
whether the factors act independently or interact with one another as they affect the 
experimental units. 

The 2” and 3” factorial treatment designs are of great practical importance and 
widely used in research studies. The 2” factorials have n factors at two levels, and 
the 3” factorials have n factors at three levels. As the number of factors increases, 
the number of treatment combinations rapidly escalates so that incomplete block 
designs are required to control experimental error. The factorial arrangement in 
each of these designs may be exploited to provide effective incomplete block de- 
signs for the investigation of factor effects and to facilitate the analysis of factor 
effects. 
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11.2 2" Factorials to Evaluate Many Factors 


The requirements of a research program may demand the investigation of many 
factors and their interrelationships as they affect the outcome of a process. Consider 
a large continuous production process in a chemical manufacturing plant that may 
not have as high a percent yield of the final product as projected by the plant con- 
struction specifications. The chemical yield of the final product is affected by fac- 
tors at each of several separate reaction steps in the process. The problem is to 
identify factors that affect the final chemical yield and to determine the level of © 
those factors that optimize the yield. The factors can include such items as the con- 
centration of catalysts, the concentrations of reactants in solvents, and the ratio of 
one reactant to another, as well as temperature, pressure, agitation rates, and resi- 
dence time in the reaction chambers. 

Clearly, the engineers who are attempting to improve the plant performance 
have a formidable task before them. They have many factors to investigate with the 
potential that only a few are of major importance. They must screen the factors to 
determine the ones that warrant more detailed study, and at the same time they must 
control the cost of the experiments. 

The 2” factorial with many factors, each at a “low” level and a “high” level, 
can be used to detect the important factors in the process with a minimum of 
experimental units. Major trends can be detected with factors at two levels to iden- 
tify potentially important factors. Consequently, 2” factorials are often used at the 
early stage of experimentation to detect potential candidate factors for more de- 
tailed investigation. The following example with three factors illustrates the fea- 
tures of 2” factorial treatment designs. 


Example 11.1. Truck Leaf Spring Manufacture 


An experiment described by Pignatiello and Ramberg (1985) was designed to 
investigate the effects of factors on a manufacturing process for leaf springs 
used on trucks. The assembled leaf spring was passed through a high- 
temperature furnace. Afterward, it was put into a forming machine to induce 
curvature in the spring by holding the spring in a high-pressure press for a 
short length of time. 

The factorial treatment design for the experiment consisted of three fac- 
tors, each at two levels. They were furnace temperature (A), heating time in 
the furnace (6), and transfer time between the furnace and the forming 
machine (C). The eight treatment combinations for the three factors in a 2° 
factorial are shown in Table 11.1, along with the observed measure of prod- 
uct quality (y) for one replication of each treatment. 


New Treatment Labels 


In general, the low level of a quantitative factor is denoted by a “0” and the high 
level by a “1.” Equivalently, the two categories of a qualitative factor can be coded 
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Table 11.1. Truck leaf spring quality observations from a 2° factorial experiment 


A B C 
Furnace Heating Transfer 
Temperature (°F) Time (sec) Time (sec) y 
1840 23 10 Ey 
1880 23 10 a 
1840 ZS 10 28 
1880 Zo 10 31 
1840 23 12 48 
1880 23 12 39 
1840 2) 12 28 
1880 Z. 12 29 


as “0” and “1.” Another useful notation for treatment combinations in 2” factorials 
is illustrated with a 2° factorial in Display 11.1. 

The uppercase letters A, B, and C represent factors. The “Treatment” label 
uses corresponding lowercase letters a,b,and c. The lowercase letter is present if 
the factor is at level 1. The lowercase letter is absent if the factor is at level 0. The 
Treatment label is “(1)” if all factors are at level 0. The correspondence between the 
Treatment labels and the (0, 1) designations for factor levels are shown in Display 
11.1, along with the actual levels of each factor in the truck spring experiment. 


Evaluating the 2” Factorial Effects 


The effect of a factor with 2” factorials corresponds to a change in the response 
from the low level to the high level of the factor. The simple effects, main effects, 
and interaction effects for factorials were discussed in Chapter 6. For illustration, 


Display 11.1 Treatment Labels for the Truck Spring Experiment 


Furnace Heating Transfer 
Treatment BC. Temperature (°F) Time (sec) Time (sec) y 
0320 1840 23 10 32 
Gr 6 1880 23 ee 
baw 1840 aS 10 28 
1 0 1880 25 10 31 
Gy 1840 23 12 48 
— 1880 a3 12 39 
fo4 1840 25 Z 28 
4 1880 25 12 oo 
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consider the observations from the truck spring experiment. The simple effects of 
Furnace Temperature as it changes from 1840°to 1880°F with factors Heating 
Time and Transfer Time held constant are shown in Table 11.2. 


Table 11.2 Simple effects of Furnace Temperature at constant levels of Heating 
Time and Transfer Time for the truck spring experiment 


Heating Transfer Furnace Temperature (A) 

Time (B) Time (C) 1840 1880 Simple Effect B & 
ie 10 32 Te eg 3g 3 0 0 
25 10 28 31. - 31-26 =3 l 0 
23 swe 48 39 «639-48 = —-9 0 ] 
2D 12 28 oe. ge 2p = I l ] 


The main effect of a factor is the average effect of moving from its 0 level to 
its 1 level. Thus, the main effect of Furnace Temperature is the average of its sim- 
ple effects 


A=73+34+(-9+1]= -05 (11.1) 


Equivalent calculations produce the main effects B= — 9.5 andC = 4:5. 

Two factors, say A and B, interact if the effect of A is different at the two 
levels of B. When Heating Time is 23 seconds, 6 = 0 and the effect of Furnace 
Temperature is 


1 
(A| B=0)==B3+(-9] = -3 


However, when Heating Time is 25 seconds, B = 1 and the effect of Furnace Tem- 
perature is 


(A|B=1)=5G41)=2 


If the furnace temperature is increased from 1840° to 1880° F the product quality is 
reduced three units if the heating time is 23 seconds. However, product quality is 
increased by two units if the heating time is 25 seconds. The response to the fur- 
nace temperature is different for the two heating times, implying the potential for 
an interaction between Furnace Temperature and Heating Time. 

The interaction between two factors, say A and B, is defined as one-half the 
difference between the effect of A at B = 1 and B = 0. The estimate of interaction 
between Furnace Temperature and Heating Time is then 


AB =; {(4|B=1)-(A|B=0)} => 2-(-3)] =25 
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If the roles of factors A and B are interchanged the resulting interaction value is 
unchanged. Calculations for the other two-factor interactions give AC = — 3.5 
and BC = — 5.5. | 

The three-factor interaction ABC arises if a two-factor interaction, say AB, is 
different for C =0 and C = 1. The AB interaction is one-half the difference 
between the effect of A at B= 1 and B=0. When transfer time is 10 seconds, 
C’ = 0 and the estimate of interaction between Furnace Temperature and Heating 
Time is 


(AB| C =0)= 53-3) =0 


The response to A is three units regardless of the level for B. The zero value for 
AB interaction when C' = 0 is represented graphically as two parallel response 
lines in Figure 11.1a. 


Heating time 
Heating time 


(a) (b) 


Figure 11.1 Graphic representation of ABC interaction depicted by different AB 
interaction atC = 0 andC = 1 


When transfer time is 12 seconds, C' = 1 and the interaction between Furnace 
Temperature and Heating Time is 


| 
(48 iO mALSeSiL sn Ga Die 
The response to A is different for 6B = 0 and B = 1. The positive measure of AB 


interaction when C’' = | is represented as two different response lines in Figure 
tiv; 
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The interaction between Furnace Temperature and Heating Time differs with 
the transfer time. The estimate of AV interaction is 0 when transfer time is 10 
seconds, and the estimate is 5 when the transfer time is 12 seconds. 

One-half the difference between these two evaluations of the AB interaction is 
the three-factor interaction. The three-factor interaction estimate is 


1 
ABC = 5 {(AB|C =1)-(AB|C =0)} = 35-0) =2.5 


The existence of a three-factor interaction translates into the two different graphic 
representations of the AB interaction in Figure 11.1. The same value for ABC 
interaction is attained if the AC’ interaction is evaluated at B = 0 and B = 1 or the 
BC interaction is evaluated at A = 0 and A = 1. 


A Table of Contrasts to Summarize the Effects 


The effects for 2” factorials can be defined as contrasts with coefficients of + 1 or 
— | for each of the treatment combinations. A table of + and — signs determines 
the correct coefficient for any treatment combination in the contrast, and the table 
provides a systematic method for calculating factorial effects. 

The full complement of + and — signs required for the contrasts in the 2° 
factorial for the truck spring experiment is shown in Table 11.3. The table of + 
and — signs is generated in the following manner: 


Table 11.3. Coefficients for contrasts in a 2° factorial treatment design 


Factorial Effects 

Treatment I A B 3 AB AC BO ABC y 
(1) + = = a as ‘iz + ss 32 
a + aig a = = = + =f = 
b i = es ae = a a — 28 
ab + + za = - <= me = 31 
Cc + ae = - aa = = ab 48 
ac + — si + = a es = 39 
be + = + + =< = + ee 28 
abc + + ae = a3 ~~ =~ + Whe 
Divisor 8 4 4 4 4 4 “ os 

Effect pte Se ae eS eS 2 

SS 5: 1605 405 125 2a. OS 12S 


e The column denoted J, containing all + signs, is used to estimate the 
grand mean with a divisor of 2”. 


e The next three columns, labeled with factors A, B, and C’, have the + and 
— signs in standard order. The standard order has factor levels arranged 
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such that column A has successive pairs of — and + signs. Column 5B 
has pairs of — signs followed by pairs of + signs. Column C’ has four 
— signs followed by four + signs. In general the kth column has 2*~! of 
the — signs followed by an equal number of + signs. 


e The coefficients for any two-factor interaction are obtained as a product of 
the columns of coefficients for the corresponding main effects. For exam- 
ple, the coefficients for the AB column are the products of the correspond- 
ing elements in the A column and the B column. 


e The coefficients for the three-way interaction ABC are obtained from the 
product of the coefficients for any set of columns whose symbol product is 
equal to ABC. The coefficients for ABC can be obtained from any of the 
symbolic products A x Bx C, AB x C, AC x B, or BC x A. 


Column J often is referred to as the /dentity column. Each column, except J, has an 
equal number of + and — signs. 

Estimates of effects in Table 11.3 are computed by multiplying the correspond- 
ing column sign by the response y in each row of the table, summing the products, 
and dividing the sum by the appropriate divisor. The calculation of the main effect 
for Furnace Temperature in Equation (11.1) is the result of using the + and — 
coefficients in column A in Table 11.3 for the corresponding value of the response 
y and dividing by 4, or 


] 
Assi Cs 32 dee Peas 39 Seto) = =o 


Similarly, utilizing the + and — codes under column B the main effect of Heat- 
ing Time is 


1 
B= -(—32 35 1 — 48 — 39 4 95 


The AB interaction effect is calculated using the + and — signs under the AB 
column to give 


ae 
AB = gor ae + 48 — 39 — 28+ 29) = 2.5 
In general, the contrast among treatment means for any main effect or interac- 
tion 1s 
ie = > kia (11.2) 


where the coefficients for the contrasts are k; = + 1. Following the convention for 
calculating contrasts, standard errors, and sums of squares in Chapter 3, the esti- 
mate of the effect for any contrast among treatment means in a complete 2” fac- 
torial can be expressed as 
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| 
AB... = =—(lap..) (11.3) 


The standard error estimate for an effect estimate is 


— 11.4 
SAB. NY Son (11.4) 


where r is the number of replications for each treatment. The 1 degree of freedom 
sum of squares for the effects shown at the bottom of Table 11.3 is obtained with 


SG Ag. ye = (lap... (11.5) 


For the example in Table 11.3, r = 1 because each treatment combination only 
occurs one time. The 1 degree of freedom sum of squares for A is 


l 
SSA = gy = 0.5 
while those for B and AB are 


1 1 
SSB= 3-38) = 180.5 and SS(AB)= g (10) = 12.5 


11.3. Incomplete Block Designs for 2” Factorials 


The use of incomplete block designs to reduce experimental error variance was 
introduced in Chapter 9. The performance of a complete replication for the 2” fac- 
torials with many factors may not be possible in a single complete block. If there is 
insufficient raw material in a manufactured batch to accommodate all of the treat- 
ments, each batch of raw materials can be used as an incomplete block. If experi- 
mental error is too large with complete block designs in agricultural field 
experiments, the variation among field plots can be controlled in the experiment 
with reduced block sizes for more homogeneous groups of experimental plots. 
Blocks of reduced size for 2” factorials can be devised by exploiting the construc- 
tion of effect contrasts in the 2” factorials. 


Sacrifice Treatment Information to Increase Precision 


Incomplete block designs for 2” factorials are constructed such that one or more 
treatment contrasts are identical to block contrasts. The treatment effect is said to be 
completely confounded with blocks. The confounded treatment effect is indistin- 
guishable from the effect of the blocks with which it is confounded. 
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Ordinarily the highest order interaction effect in a 2” factorial is chosen to be 
confounded with blocks. Main effects, two-factor interactions, and other lower 
order interactions, in the case of experiments with many factors, are those effects of 
most interest. By confounding the highest order interaction the other effects are 
estimated without penalty. 

The construction of an incomplete block design is illustrated with a 2° fac- 
torial. A complete block design requires eight experimental units per block. Half of 
the treatments have a + coefficient and half have a — coefficient for every effect. 
A resolvable incomplete block design with two blocks of four units each per repli- 
cation can be constructed using the contrast /4pc. 

From Table 11.3, the ABC interaction is estimated with the comparison 


lapc = abe +a+6+c—ab—ac — bc —- (1) 


Put the treatment combinations with a + coefficient—abc, a,b, and c—in one 
block and the treatment combinations with a — coefficient—ab, ac, bc, and (1)— 
in the other block. The two incomplete blocks of treatments are shown in Display 
112. 


ABC Interaction Confounded in Two Blocks 
of Four Experimental Units Each 


Display 11.2 


Block 


The comparison (block 1 — block 2) is the contrast required to estimate the 
ABC interaction. As a consequence, the estimate of the ABC interaction effect is 
completely confounded with a comparison between the blocks. It will not be possi- 
ble to estimate the three-factor interaction independent of block effects. 

On the other hand, the other six factorial effects are not confounded with 
blocks and can be estimated in the usual way. For example, the main effect of fac- 
tor A estimated from 


l, = (abe +a —b—c)+(ab+ac — be — (1)) 


contains two + coefficients and two — coefficients from the four units in each 
block. Any differences among the blocks will not affect the estimate. 
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Block Construction with the Evens—Odds Rule 


Half of the treatments have a + coefficient and half have a — coefficient for 
every effect in the 2” factorials. The treatments can be divided into the two groups 
with the Evens—Oadds rule. 

Any treatment combination that has an even number of letters from the fac- 
torial effect receives one of the coefficients (+ or —). Treatments with an odd 
number of letters from the factorial effect receive the other coefficient. 

The ABC interaction has the associated letters a, b, and c. Treatments (1), 
ab, ac, and be have an even number of letters from the ABC factorial effect. 
Treatment (1) with zero letters has an even number of letters, since zero is con- 
sidered an even number. Treatments a, b, c, and abc have an odd number of 
letters. The treatment combinations with the + coefficient for the ABC 
interaction effect are 


a, b, c, and abc 


and the treatment combinations with the — coefficient for the ABC interaction 
effect are 


(1), ab, ac, and be 


The treatment combination that contains all of the factor letters, abc in this 
case, will always have the + coefficient for any factorial effect. All of the treat- 
ment combinations in that group will also have the + coefficient. The treatment 
combinations in the other group will have the — coefficient for the factorial effect. 
This assignment of + and — coefficients for the contrast coincides with that 
shown in Table 11.3. 

Suppose the ABCD interaction from a 2‘ factorial is to be confounded with 
blocks for an experiment with two blocks of eight experimental units per block in 
each replication. The letters associated with the four-factor interaction effect 
ABCD are a,b,c, and d. The treatment combinations with an even number of 
letters are 


(1), ab, ac, ad, bc, bd, cd, and abcd 
and the treatment combinations with an odd number of letters are 
a, b, c, d, abc, abd, acd, and bcd 


The treatments in the group containing the treatment combination abcd receive a 
+ coefficient since the treatment combination abcd contains the letters of all four 
factors. The other group of treatments all have a — coefficient for the ABCD 
interaction effect and are placed in a separate block from the first group. 


Analysis of Variance Outline for a Completely Confounded Design 


The sums of squares are computed as usual for the analysis of variance except 
_ for the exclusion of a sum of squares partition for the interaction effect confounded 
with blocks. The block sum of squares will include the confounded factorial effect. 
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The sources of variation and degrees of freedom for the analysis of variance 
are outlined for the 2° factorial with b = 2 incomplete blocks in each of r = 2 
replicates in Table 11.4. Since ABC is confounded with blocks the sum of squares 
for blocks includes the ABC effect. 


Table 11.4 Analysis of variance for a 2° factorial with b = 2 incomplete blocks 
in each of r = 2 replicate groups 


Source of Degrees of 
Variation Freedom 
Replicates r—1l=1 
Blocks within replicates r(b—1)=2 
Treatments 6 
A l 
B | 
= l 
AB 1 
AC l 
BC ] 
Error 6 
Total 15 


Retain Some Treatment Information with Partial Confounding 


Block size was reduced in the previous section by confounding the highest order 
interaction with blocks. However, any gain that may occur with a reduced experi- 
mental error has its price. In the previous designs, the loss of total information on 
the confounded factorial effect was the cost for a possible reduction in the experi- 
mental error variance. 

Any factorial effect can be confounded with blocks. To avoid the loss of all 
information on any one factorial effect a different effect can be confounded in each 
replication group of the resolvable design. In this way a factorial effect is only con- 
founded in one of the replications and thus is said to be partially confounded with 
blocks. 

A 2° factorial with a different effect confounded in each of three replications is 
used to illustrate the principle in Example 11.2. 


Example 11.2 Partial Confounding in a 2° Factorial 


The purity of a chemical product was thought to be influenced by three fac- 
tors—agitation rate (A), base component concentration (8), and concentra- 
tion of reagent (C’). The chemist set up an experiment using a factorial 
treatment design with each of the factors at two levels for a 2° factorial 
arrangement. 
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Experiment Design: The chemist wanted three replications of the experi- 
ment, but only four runs of the chemical process could be conducted in a 
single day. Therefore, each replication had to be run in two incomplete blocks 
(days). 

An effect contrast for a 2° factorial consists of four treatment combina- 
tions with a + coefficient and four treatment combinations with a — coeffi- 
cient. Thus, it was possible to construct the incomplete block design by 
confounding a 2° factorial effect with blocks. To avoid complete confound- 
ing of one effect, the chemist confounded a different two-factor interaction in 
each replication. The treatment combinations required for each block are 
shown in the diagram in Table 11.5. 


Table 11.5 Observed purity of a chemical product in a partially confounded 2° 


factorial 
BC Confounded AC Confounded AB Confounded 
44 —] +1 —] a4 —] 


Block 1 Block 2 Block 3 Block 4 Block 5 Block 6 
Replicate I Replicate II Replicate IT 


The BC’, AC, and AB interactions were each confounded in one replication of 
the experiment. The two-factor interaction confounded with the two blocks in each 
replication is shown above the blocks. The observed purity rates are to the right of 
the treatment combinations in each block. 

The treatment combinations required in each of the incomplete blocks can be 
determined with the Evens—Odds rule. In replicate I the BC interaction is con- 
founded by placing treatment combinations with an even number of the letters b 
and c together in block 1. They are (1),a, bc, and abc. The treatments with an odd 
number of the letters 6 and c are placed in block 2. They are c, 6, ab, and ac. The 
treatments in block 1 have the +1 coefficient for the BC’ interaction effect 
because the treatment combination abc contains all of the factor letters. The treat- 
ments in block 2 have the — 1 coefficient for the BC’ interaction effect. Conse- 
quently, the difference between the observations in block | and block 2 will have 
the estimate of the BC’ interaction effect confounded with the difference between 
the effects of blocks 1 and 2. The treatment combinations in each of the blocks of 
replicates II and III can be determined in a similar manner with the AC’ effect con- 
founded in replicate II and the AB effect confounded in replicate III. 
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Computing the Sums of Squares for Partially Confounded Designs 


The sums of squares for the effects not confounded with blocks, A, B,C’, and 
ABC, may be calculated in the usual manner. The sums of squares can be calcu- 
lated from Equation (11.5) using the coefficients for the contrasts in Table 11.3. 
The effects can be estimated in all replications, thus the sum of squares is 
SS = 7 ky.) /2", where r=3, 2" = 8, and k; = + l-The calculations are 
shown in Table 11.6. 


Table 11.6 Computations for factorial effects not confounded with blocks 


BS ee i . Saas ae: Des ae 
Mean: 26 2. 3). a]. 32. 40... 36.35 ae, SS 


A — oe ee ee 
B ee eS 
@ MRE ge We SC. aa, SAORL See eee EN 
ABC so pe en pe ir sae hyena eee ee = 50) 


The sum of squares for each of the partially confounded two-factor interactions 
must be calculated from the replications in which they are not confounded. The 
sum of squares for AB must be calculated from replicates I and I, for AC’ from 
replicates I and III, and for BC from replicates II and III. Totals are used instead of 
the means for these calculations. 

The sum of squares can be computed conveniently by determining the effect 

contrast from all of the observations and subtracting the value of the contrast repre- 
sented by the difference of block totals for the replications in which the effect is 
confounded. The three confounded contrasts, AB, AC and BC, computed from the 
totals of all treatment observations are 


Treatments (1) a Pg Ee? Ngess soos abc 
Totals TS TOF Ve Tay 96 Ire 106. 12 Ukiy,; 


AB + - -— + + -— = + 18 
AC + - + = = + = + = ==24 
BC + + = = = = + + = =30 


The totals and difference computed for each pair of blocks are 


Replicate I Replicate II Replicate III 
BC Confounded AC Confounded AB Confounded 
Block 1 Block 2 Block 3 Block 4 Block 5 Block 6 
Totals 126 146 i3s 133 161 133 
Difference — 20 — 20 8 
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To estimate the AB effect contrast only from replicates I and II, the difference 
between blocks 5 and 6 (8) is subtracted from the AB contrast obtained from the 
totals of all observations, 18. The corrected estimate of the contrast is 


18-8 


5 
2 


lap = 


where the divisor is r = 2 to put the result on a mean basis. The sum of squares for 
AB interaction is 


2(5)° 
SS(AB) = = = 625 
Similarly, the estimate of the AC contrast is 
oe: ee ee 
1c], oes 
with sum of squares 
pe 
SS(C4Q) = =e1 
Finally, 
— 30 —(— 20) 
lee = po ae —5 
and 
2(-—5/ 
SS(BC) = == 6:25 


The analysis of variance is shown in Table 11.7. The design is a resolvable 
incomplete block design with blocks of treatments in complete replication groups. 
The total sum of squares and the sums of squares for replicates and for blocks with- 
in replicates are computed in the usual manner. The sum of squares for experimen- 
tal error is found by subtracting the sums of squares for all treatment effects, 
replicates, and blocks within replicates from the total sum of squares. 


Tests of Hypotheses About Factor Effects 


The critical value for a test of hypothesis for any factorial effect is F'o51.11 = 4.84. 
The agitation rate (A) and base component concentration (B) main effects are 
significant. None of the other effects had a significant effect on the purity of the 
chemical product. 

From Equations (11.3) and (11.4), the estimated effect of agitation rate is 


a 


pee 


10 
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Table 11.7 Analysis of variance for purity of chemical product in a partially 
confounded 2? factorial 


Source of Degrees of Sum of Mean 

Variation Freedom Squares Square Fo 
Replicates 1 111.00 55.50 

Blocks within reps 3 108.00 36.00 

A l 600.00 600.00 40.6 
B l 253.50 253.50 17.2 
GS l 54.00 54.00 at 
ABC l 13.50 1300 <3 
AB (Reps I, II) l 6.25 6.25 <1 
AC (Reps I, III) l 1.00 1.00 < |] 
BC (Reps I, II) ] 6.25 6.25 <4 
Error rt 162.50 14.77 

Total pa 1,316.00 


with standard error 


oo). 
sA = = : 


The estimated effect of the base component concentration is 


26 
B=—=65 
4 


with standard error sg = 1.57. The purity of the chemical product is increased by 
10 units if the agitation rate is increased from the low level to the high level. The 
purity is increased by 6.5 units if the concentration of the base component is in- 
creased from the low level to the high level. The reagent concentration C had no 
significant effect on the purity of the product, and there was no significant interac- 
tion among any of the factors. 


Confound Another Interaction to Further Reduce Block Size 


The number of experimental units per block can be reduced further by choos- 
ing a second factorial effect to confound with blocks. The design will have four 
blocks per replication if two factorial effects are used for confounding. 

The technique is illustrated with the 2° factorial to produce four blocks of two 
experimental units. If the interaction effects AB and AC are chosen as 
confounding effects, the assignment of + and — coefficients to the treatments by 
the Evens—Odds rule is 
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Gh} ic abe ra ab b ac a be 


Four blocks of two experimental units are formed by taking treatment pairs that 
have the same configurations of + and — coefficients for the AB and AC 
interactions. 

There are four configurations of + and — coefficients for the joint AB and 
AC’ interaction contrasts. Each configuration contains two treatments. The two 
treatments are each assigned to a separate block. The configurations and treatment 
assignments to blocks are 


AB AC Treatment Block 
+ i tl} abe ] 


+ _ c ab 2 
— + b ac 3 
— ~ a be 4 


A Third Effect Is Automatically Confounded 


If two effects are confounded with blocks in a 2” factorial, then a third effect is 
confounded. There are 3 degrees of freedom for block comparisons, and three treat- 
ment effects are confounded with blocks. The AB interaction is confounded with 
the block contrast 


AB=/1,=B,+B,.—B3;-— By 


where 5;, Bo, Bs, and By represent block means. The AC interaction is con- 
founded with the block contrast 


AC = 1, = B, — B, + B3 — By 


The third contrast among the blocks that is orthogonal to AB =], and AC = J is 


lj = By -Bo-B3+ By 
= (1)+ abe —c-—ab—b—ac+a+be 
If the contrasts in Table 11.3 are checked it can be seen that the third contrast is 


equal in the BC interaction. 


The Third Confounded Effect Is a Generalized Interaction 


The third effect confounded is known as the generalized interaction of the 
first two confounded effects. The generalized interaction of two effects is obtained 
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by forming the symbolic product of the two effects and striking out any letters that 
appear twice in the product. 

For example, AB and AC were chosen as the confounding effects. Their sym- 
bolic product is ABAC. Striking out the letter A, which appears twice, the result is 
ABAC = XBXC = BC. The generalized interaction is BC. Suppose ABC and 
AB are chosen as confounding effects. The generalized interaction is ABC AB 
— MRBCMRB = C. The main effect C is confounded with blocks along with ABC 
and AB. Care must be taken to avoid confounding effects that are of particular 
interest in the study, especially main effects. 

A table is given in Appendix 11A to aid in the construction of useful incom- 
plete block designs for 2” factorials. A general method of confounding that can be 
used with factorial systems other than the 2” series is presented in the next section. 


11.4 A General Method to Create Incomplete Blocks 


The allocation of treatment combinations to incomplete blocks for the 2” factorials 
has been accomplished thus far on the basis of the + signs of the effects con- 
founded with blocks. However, the method becomes rather cumbersome with many 
factors due to the large number of treatment combinations involved in defining the 
contrasts. In addition, the use of + signs for the confounding system does not 
carry over to other systems such as the 3” factorials. 

A general method to construct incomplete block designs with chosen factorial 
effects confounded with blocks utilizes the mathematics of residues modulo m or 
residues mod m. For an integer k the residue mod m is the remainder when k is 
divided by m. The residue r for the integer k mod m is written as k = r(mod m). 

With 2” factorials we work with residues of (mod 2). Any integer divided by 
m = 2 leaves a remainder of 0 or 1. Thus, the values for the residues (mod 2) are 0 
and 1. The even integers (mod 2) have residue 0 and the odd integers (mod 2) have 
residue 1. For example, 7 = 1(mod 2) since 7 divided by 2 is 3 with a remainder of 
1. Also, 4 = O(mod 2) since 4 divided by 2 is 2 with a remainder of 0. 

The levels of a factor (0 or 1) are used as the values of a variable zx; repre- 
senting the ith factor. A treatment combination for the 2” factorial is represented as 
the sequence 21 22%3---2,. For a 2° factorial the eight treatments written in standard 


order are 
a Treatment 

0 0 0 (1) 

] 0 0 a 

0 l 0 b 

l l 0 ab 

0 0 ] c 

1 0 ] ac 

0 l l be 

l | 1 abc 
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A Defining Contrast for Two Blocks 


A general method to determine the allocation of treatment contrasts to incomplete 
blocks is accomplished with a linear function 


L=aj4%, + a9%q +++: + An2n (11.6) 


where L is the defining contrast, or the contrast confounded with blocks. The value 
of a; is 1 if the ith factor is present in the defining contrast and 0 if the factor is ab- 
sent in the defining contrast. The defining contrast function L is evaluated for each 
treatment combination. The value of x; is the level of the ith factor (0 or 1) in any 
treatment combination under consideration for allocation to an incomplete block. 

Suppose the defining contrast for a 2° factorial in two blocks of four units each 
is the two-factor interaction AB. Factors A and B are present in the defining con- 
trast so that a; = 1 and ag = 1. Since factor C’ is not in the defining contrast, 
a3 = 0. The defining contrast function for allocation of treatments is 


= Li + Xa (11.7) 


The values of x; and x» for each treatment combination are substituted into L and 
the residue for L modulo 2, r(mod 2), is determined. For example, the value of 
L = 21+ 22 for the treatment combination bc, 1,273 = (011), is LD = 0+ 1. The 
residue for L = 1 is 1(mod 2). The values of L and residues, L = r(mod 2), are 


Treatment 2, So. ole = Bact Fo>-4(mod 2) 
(1) 0 0 0 0 
a | 0 | 1 
b 0 l | 1 
ab 1 1 2 0 
Cc 0 0 0 0 
ac ] 0 ] ] 
be 0 l l l 
abc 1 | 2 0 


The treatment combinations with L = 0(mod 2) are assigned to one block, and the 
treatment combinations with L = 1(mod 2) are assigned to the other block. Thus, 
the treatment assignments are 

Block 1 (lig... ape with L = 0 (mod 2) 

Block 2 O-eae:- Pe with L = 1 (mod 2) 


Let B, and Bz represent the block totals. The block with residue 0 for L, block 
1, contains the treatment combinations with a + sign for the AB contrast, while 
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block 2 with residue 1 for L contains the treatment combinations with a — sign for 
the AB contrast. Thus, the contrast / among block totals equivalent to the AB 
contrast is 


lap = B, — Bo 


Use Two Defining Contrasts for Four Blocks 


The designs to this point have been constructed for 2” factorials with two incom- 
plete blocks of 2”~' experimental units, each with one defining contrast con- 
founded with blocks. The use of two blocks with one factorial effect confounded 
may not reduce block size sufficiently for a 2” factorial when there are many fac- 
tors, say n > 4. For example, a 2° factorial with 32 treatments may require block 
sizes no larger than eight units per block with four blocks per replication. Further 
reductions in block size can be accomplished by confounding an additional defin- 
ing contrast with blocks. 

Suppose a 2‘ factorial is to have the 16 treatment combinations placed in four 
blocks of 24~2 = 2? = 4 experimental units each. Two defining contrasts are re- 
quired to construct the four incomplete blocks. If AB and CD are chosen to be 
confounded with blocks, then the defining contrasts are 


Ty = 227i + £9 (11.8) 


Ig =%34+2%4 


where L, represents AB and L» represents C'D. Each treatment combination will 
provide a pair of residues modulo 2 for the pair (LZ, L2). There are four pairs of 
residues possible—(0,0), (0,1), (1,0), and (1,1). Treatment combinations with the 
same values for a pair of residues modulo 2 are placed in the same incomplete 
block. The values for the defining contrasts, L,; and L» in Equation (11.8), and the 
residue pairs for each of the 16 treatment combinations are shown in Display 11.3 
along with the block assignments. 

The 3 degrees of freedom among the blocks represent three orthogonal con- 
trasts among the blocks. Two of the contrasts are known to include the two-factor 
interactions, AB and CD, chosen to be confounded with blocks. The contrast for 
the AB interaction is the difference between the blocks with residues 0 and 1 for 
Ly. Let By, Bo, Bz, and By, represent the block totals. The block contrast for AB is 


lap = Bi + Bo— B3 — Ba 


because blocks 1 and 2 have residue 0 and blocks 3 and 4 have residue | for 1. 
Similarly, the blocks with residues 0 and 1 for L define the contrast for the CD 
interaction. The block contrast for C’D is 


lop = Bi — Bo + Bs — By 


because blocks 1 and 3 have residue 0 and blocks 2 and 4 have residue 1 for Lz. 
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Display 11.3 Incomplete Block Design for a 24 Factorial in Four Blocks 
of Four Units with Defining Contrasts AB and CD 


Block Treatment 2, 2 23 4 Ly Le Residue 
| (1) O 20sec Seus® O20 (0,0) 
ab besk 2c§ .2hp 2 2550 (0,0) 
cd GO a4ss4 Ou 92 (0,0) 
abcd pe seep tees £9 | Sage (0,0) 
2 e ee tae ee G 1 (0,1) 
d Cae Bes 1 (0,1) 
abc SD ee (0,1) 
abd ope) 1 > ee (0,1) 
3 a a ee ee eee (1,0) 
b ey 0. fp (1,0) 
acd ae oe ee ie: (1,0) 
bed ee Se eee Ee: (1,0) 
4 ac =) l | (1,1) 
be ees ee See | (7.0) 
ad ee Be re l 1 (1,1) 
bd ee ee 1 l et) 
Ly = 13+ 2X4 


Determining the Generalized Interaction 


The ABC D interaction is the generalized interaction confounded as a consequence 
of purposely confounding AB and CD with blocks. The generalized interactions 
can be determined by more formal algebraic rules than those described in the previ- 
ous section. Form the product of the symbols for the defining contrasts with the 
exponent of any symbol reduced modulo 2. The product of AB by CD is 


AB x CD'= ABCD 


The generalized interaction of AB and C'D as determined by the symbol product is 
ABCD since all exponents of the symbol product are 1(mod 2). 

Suppose the contrasts ABC’ and BC'D had been chosen for the original defin- 
ing contrasts. The product is ABC x BCD = ABBCCD = AB?C?D = AD, 
where B? and C? do not appear in the generalized interaction because their expo- 
nents are O(mod 2). Therefore, the generalized interaction is AD when ABC and 
BCD are the defining contrasts. 

Because main effects and two-factor interactions are of greatest interest in pre- 
liminary studies of factor effects, we try to avoid confounding them with blocks. 
The number of experimental units in an incomplete block design for 2” factorials is 
equal to the powers of 2—2, 4, 8, and so forth—up to 2”~!. In general, if blocks of 
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size k = 29 are required there will be 2"/22 = 2”~4 blocks in a complete replica- 
tion. Consequently, n — q defining contrasts must be chosen. 

Consider the 2’factorial with 128 treatment combinations. A design with 
24= 16 experimental units per block requires 2’-*=2?=8 blocks and 
(7 — 4) =3 defining contrasts. Choose ABG, CDE, and EFG as the defining 
contrasts to reduce block sizes. The first contrast, ABG, is used to reduce block 
sizes to 64 units. The second, C DE, is used to reduce block sizes to 32 units, and 
the third contrast, HFG, is used to reduce block sizes to 16 units. There will be 
four generalized interactions also confounded with the eight blocks. They are 


ABG x CDE = ABCDEG 
ABG x EFG = ABEFG = ABEF 


CDE x EFG =CDE?FG =CDFG 
and | 


ABG x CDE x EFG = ABCDE* FG? = ABCDF 


As before, the treatment combinations are assigned to the blocks according to 
the residues of the defining contrasts 


ABG et Ty =s.0p se bez 
CDE ee Tog = 73 + 444+ 2X5 
EFG P D3 = 245+ 2% + 27 


Each treatment contrast will have a triplet of values (L1, L2, D3). There will be 
eight triplets of residues—(0, 0, 0), (0, 0, 1), (0, 1, 0), (1, 0, 0), (0, 1, 1), C1, 9, 1), 
(1, 1, 0), and (1, 1, 1)—each representing the residue triplet for a block assignment 
of the treatment combinations. A treatment combination with the residue triplet 
(0, 0, 0) is assigned to one block, a treatment combination with the residue triplet 
(0, 0, 1) is assigned to a second block, and so forth. 

Table 11A.1 in the Appendix lists defining contrasts and their generalized 
interactions to construct incomplete block designs for 2” factorials with block sizes 
k > 4 for n = 4,5,6, and 7 factors. Notice that some two-factor interactions will 
be confounded with blocks when blocks have four or less units. Designs with only 
three-factor and larger interactions confounded are always possible with block sizes 
of eight or more. If the defining contrasts are properly chosen it will be possible to 
avoid confounding any two-factor interactions or main effects. | 
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11.5 Incomplete Block Designs for 3” Factorials 


The 2” factorials are useful designs to detect factors with major effects on the mea- 
sured responses in an experiment. The 3” factorials have three levels for each fac- 
tor, making it possible to estimate linear and quadratic trends for quantitative 
factors and to provide more detailed descriptions of qualitative factor effects. How- 
ever, the number of experimental units required by 3” factorials increases by 
powers of 3 as more factors are added. Thus, incomplete block designs can be very 
useful with these treatment designs. The construction of incomplete blocks designs 
for 3” factorials is discussed briefly in this section. 


Some 3” Basics 
Notation for 3” Factorials 


The levels of a factor are represented by x; = 0, 1, 2. For example, the nine 
treatment combinations for a 3? factorial with factors A and B are 


The three columns of the array represent treatment combinations for the three 
levels of factor A, and the three rows represent treatment combinations for the three 
levels of factor B. 


Three Incomplete Blocks Required for Confounded 3” Factorials 


The construction of incomplete block designs for 3” factorials requires three 
blocks to have blocks of equal size. There will be 2 degrees of freedom between 
blocks, and a treatment effect with 2 degrees of freedom must be confounded with 
blocks. 

The 3” factorials have 2 degrees of freedom for main effects, 2? degrees of 
freedom for two-factor interactions, and so forth. We would not want to confound 
the main effects with blocks. Instead, the interactions are partitioned into two 
orthogonal components, each of which has 2 degrees of freedom. The orthogonal 
component can then be used as the defining contrast in the construction of the 
incomplete block design. 


Construct Incomplete Blocks with Defining Contrasts 


The allocation of treatment combinations to blocks utilizes the defining contrast 
function L = a,x; + aor +++: + Qn2, introduced in Section 11.4. The value of 
x; for the 3” factorial is x; = 0, 1, 2. As a mathematical convenience, the values of 
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a; = 0,1,2 are used for factors included in the 2 degrees of freedom interaction 
used as the defining contrast. For example, with two factors, A and B, the AB 
interaction with 4 degrees of freedom is split into defining contrasts labeled AB 
and AB’, each with 2 degrees of freedom. By convention, the first letter of the 
interaction expression always has a power of 1 so that A?B or A? B? are not used as 
defining contrast expressions. The powers of A and B are the coefficients a, and 
a in the defining contrast function L = a,27| + ao%. 

The modulo 3 residues of L = ayx1 + Q9%2 +---+QnZpy are determined for 
3” factorials since there are three levels for each of the factors. The residues r(mod 
3) are 0, | and 2. The defining contrast function for AB with a; = az = 1 is 


Ty = 27 + 5 
and for AB? with a; = 1 and a2 = 2 the defining contrast function is 
Lo = 2 +22 


The treatments with L; = O(mod 3) are assigned to one block, those with 
L;, = 1\(mod 3) are assigned to a second block, and those with L; = 2(mod 3) are 
assigned to the third block. The treatments are assigned to blocks according to the 
value of the residues for L,; or Ly. The common practice is to have one-half of the 
replications assigned according to each of the defining contrast functions. The treat- 
ment assignments for L; and Lz are shown in Display 11.4. 


Display 11.4 Incomplete Block Design for a 3? Factorial with the 
AB or AB? Component Confounded with Blocks 


AB confounded 
[Ly =2%14+ 22 


Block 1 Block 2 Block 3 
00 i zt 01 10 22 02 11 20 


Ly = O(mod 3) diy = 1m 3) Ly = 2(mod 3) 


AB? confounded 
Lo =a27+ 229 


Block 1 Block 2 Block 3 
00 11 Ze 02 10 21 01 12 20 
Ly = O(mod 3) L2 = (mod 3) D2 = 2(mod 3) 
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The residues of L, and Lz for the nine treatment combinations are 


XL} x2 L;  r(mod 3) Ly ~~ r(mod 3) 
0 0 0 0 0 0 

0 l ] ] 2 2 

0 2 2 2 4 l 

] 0 ] 1 | l 

l l 2 Z 3 0 

l 2 3 0 5 2 

2 0 2 2 Zz 2 

2 | 3 0 4 ] 

. ys 4 | 6 0 


Confounding with Three or More Factors 


The 3° factorial requires 27 experimental units for a single replication. An incom- 
plete block design with three blocks of nine experimental units each can be con- 
structed by confounding a three-factor interaction component with blocks. The 
three-factor interaction with 8 degrees of freedom has four components, each with 
2 degrees of freedom. For the purpose of obtaining a defining contrast the four 
components are designated ABC, ABC”, AB?C, and AB?C”. The defining con- 
trast for each of these components is 


ABC a L=2,+2%2.4+ 23 
ABC -- DL=2,+%.+223 
ABC == L=2,+222+ 23 
AB-C* ae D=2,+2% +223 


Any component of the three-factor interaction can be used to generate three 
blocks of nine experimental units each. If the component AB?C is used, the defin- 
ing contrast is 


DL=2,+22%2.+ 23 


The three blocks are constructed with treatment combinations 27127273 that provide 
the residues for the defining contrast of L =0O(mod 3), ZL = 1(mod 3), and 
L = 2(mod 3), respectively. 

Residues are determined for each of the 27 treatment combinations—000, 001, 
002, 010,... ,222. For example, the treatment combination 000 has residue 
L =0+2(0) + 0 = 0(mod 3) and the treatment combination 021 has residue L = 
0 + 2(2) + 1 = 2(mod 3). Each is placed in its respective block with other treat- 
ments that have the same residues for the defining contrast. A different component 
of the three-factor interaction can be confounded in each replication of the 
experiment. 


386 CHAPTER 11 


INCOMPLETE BLOCK DESIGNS: FACTORIAL TREATMENT DESIGNS 


Four-factor interaction components are confounded with blocks in 34 
factorials. The eight components each with 2 degrees of freedom are ABC D, 
ANCD, ABCD ABCD. 2p CP ARC. ABCD. ac APC OO. 
Any of these components can be used to block the 81 experimental units into three 
blocks of 27 experimental units each. 


Two Generalized Interactions Occur with Further Reductions in Block Size 


The number of incomplete blocks remains a multiple of 3 with the 3” factorials 
upon reduction of block sizes. A second defining contrast is confounded with 
blocks to reduce block sizes from nine experimental units to three experimental 
units for a 3° factorial. In the case of 3” factorials, there are two generalized inter- 
actions for each pair of defining contrasts. 

If two defining contrasts are chosen, say X and Y, then the generalized inter- 
actions are given as the symbolic products XY and XY%. The exponents of the 
symbolic products are reduced modulo 3 to derive the generalized interactions. 
Suppose the two-factor interaction components X = AB and Y = AC’ are used as 
defining contrasts. 

The XY symbolic product is 


XY BAB X AC? = A’ BC" 


Since the exponent of the first term of the symbolic product A is not unity, upon re- 
duction modulo 3, the product is squared and the exponents are reduced modulo 3 
as follows: 


(A?BC’) = A*B?C* = AB’C 


where the last term results from reducing the exponents of the previous term 
modulo 3. 
The XY? symbolic product is 


XY? = ABx (AC’)? = AB x A?C* = A? BC" 
and upon reducing the exponents of the last term modulo 3, the result is 
AY? =e BC 


Upon choosing the two defining contrasts, AB and AC”, two generalized 
interactions, AB?C and BC, are also confounded with the nine blocks of three 
experimental units each. Care must be exercised in the choice of defining contrasts 
to avoid confounding a main effect with blocks. For example, the choice of two 
three-factor interaction components, ABC and AB?C%, results in a generalized 
interaction of 


RY, ABC YAR C= AEC: (AE = ABC’ 


When the exponents of the squared symbolic product are reduced modulo 3 the 
generalized interaction is the main effect A. 
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Suppose the two interaction components AB and AC? are chosen to construct 
nine blocks of three units each for the 3° factorial. The defining contrasts are 


Ly = 214+ 22 
and 
Lo = Tj + 223 


There will be nine pairs of residues modulo 3 from the two defining contrasts, and 
three treatment combinations will be associated with each of the pairs of residues. 
For example, the residues (1, L2) = (2,2) occur with treatment combinations 
(021), (112), and (200). Those three treatment combinations are placed in the same 
block. 


11.6 Concluding Remarks 


As we have seen in this chapter, the factorials are versatile treatment designs that 
can be adapted to a variety of experimental blocking conditions. The use of incom- 
plete block designs results in some factorial effects confounded with blocks, either 
completely or partially, and increased complexity in the analysis of the results with 
the partially confounded designs. The goal of reducing block size is to reduce the 
experimental error variance. If the reduction in the estimate of experimental error 
variance is sufficient to overcome the loss in some information on the confounded 
factorial effects, then the use of an incomplete block design has been worthwhile. 
Cochran and Cox (1957) have tabled a number of incomplete block designs for 
symmetrical and asymmetrical factorials and provide some examples of the analy- 
sis with several of these designs. Detailed information on the underlying principles 
of design construction can be found in Kempthorne (1952), John (1987), and John 
and Williams (1995). 


EXERCISES FOR CHAPTER 11 


1. Construct one replication of an incomplete block design for a 2* factorial in two blocks of eight 
experimental units each with ABC'D confounded with blocks. Randomly assign the treatments to 
the eight experimental units in each block. 


2. Suppose an incomplete block design is required for a 2° factorial, and the blocks cannot exceed 
eight experimental units in size. 
a. How many blocks will there be? 
b. How many defining contrasts are required for one replication of the experiment? 
c. How many other effects, generalized interactions, will be confounded with blocks? 
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d. Choose some defining contrast(s) to construct one replication of the design, and determine 
what other interaction(s), if any, also will be confounded with blocks. 


3. Four replications of a 2* factorial experiment are required. The design must be conducted with 
incomplete blocks of eight experimental units. Construct a design such that no effect is completely 
confounded with blocks in the experiment. 


4. A 2" factorial experiment is to be conducted in eight blocks of 16 experimental units, each using 
ABCD, ABEF, and EFG as defining contrasts. What other effects are confounded with blocks? 


5. Suppose a 2° factorial experiment was conducted in eight blocks of eight experimental units each, 
and BCDE, ABDE, and ADE were used as defining contrasts. 
a. What other effects were confounded with blocks? 
b. Could there have been a better choice of defining contrasts for this design? Explain. 
c. If you have decided that a better choice is possible, then choose a set that will improve the de- 
sign and defend your choice. 


6. An animal scientist conducted a study on the effects of heat stress and dietary intake of protein and 
saline water on laboratory mice. The three factors were each used at two levels in a 2° factorial 
arrangement. The levels for the factors were (A) protein (low, high); (B) water (normal, saline); 
and (C’) heat stress (room temperature, heat stress). An incomplete block design was used with four 
mice from an individual litter used in each block. Each mouse was put in an individual cage and 
assigned one of the treatments. One replication of the experiment consisted of two litters of mice. 
The weight gains (grams) for the mice are shown for each mouse next to the treatment 
identification. 


Litter 1 Litter 2 Litter 3 Litter 4 Litter 5 Litter 6 


Replicate I Replicate I Replicate III 


a. Which treatment effects are confounded with blocks (litters)? 

b. Estimate the factor effects and interactions and their standard errors, and compute the analysis 
of variance for these data. 

c. Interpret the results. 


7. An experiment was conducted to investigate the effects of four factors on the operation of a metal 
lathe. The four factors each at two levels were (A) speed of lathe rotation (60, 75); (B) angle of 
cut (30, 45); (C) frequency of lubrication (10 sec, 30 sec); and (D) alloy for cutting tip (1 and 2). 
The factor levels were used in a 2* factorial arrangement for the experiment. Only eight cutting 
trials could be run in a single day. An incomplete block design with two blocks (days) in each of 
two replications was set up with ABC D as the defining contrast. The cutting tip wear for each of 
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the treatments in each block follows. Days 1 and 2 constitute the first replication and Days 3 and 4 
constitute the second replication. 


Cutting Tip Wear 
Day 1 Day 2 Day 3 Day 4 


a. Compute the analysis of variance for the data. 
b. Compute the factor effects and their standard errors. 
c. Interpret the results. 


Design an incomplete block design for a 3° factorial that has nine experimental units per block in 
four replications of three blocks each. Confound a different component of the three-factor interac- 
tion in each replication. 


An investigator needs an incomplete block design for a 3° factorial that has three experimental 
units in each block. Consider the two pairs of defining contrasts to construct the design: 

av ABC, AR 

b. AB ae 

Which is the better pair of defining contrasts to construct the design? Explain. 
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11A Appendix: Incomplete Block Design Plans 


for 2” Factorials 


Table 11A.1. Number of factors and blocks, block sizes, defining contrasts, and 
generalized interactions to construct incomplete block designs with 2” factorials 


Factors Blocks 


n Zz os 
4 2 
4 
5 As 
4 
8 
6 Z 
4 
8 
16 
7 2 
4 
8 
16 
32 


Block 
Size 
kv. 27 


8 
4 


16 


8 
4 


32 


16 


Defining 

Contrasts 
ABCD 
ABC, ABD 


ABCDE 
ABC, CDE 
ABC, ACD, 
ADE 


ABCDEF 


ABCD, CDEF 


ACE, ABEF, 
ABCD 
ABF, ACF, 
CDP, DEF 


ABCDEFG 
ABCDE, 
ABEFG 
ABG, CDE, 
EFG 

ABC, ADG, 


CDE, DEFG 


ABG, BCG, 
CDG, DEG, 
EFG 


Generalized 
Interactions 


€D 


ABDE 
BD, CE, ABE, BCDE 


ABEF 
ADF, BCF, BED, CDEF 


AD, CE, BC, BE, AEF, 
BDF, ABCD, ABDE, 
ACDE, ABCEF, BCDEF 


CDFG 


ABEF, CDFG 

ABCDF, ABCDEG 

AEF, BDF, BEG, CFG, 
ABDE, ABFG, ACDF, 

ACEG, BCDG, BCEF, 
ABCDEFG 

AC, BD, BF, DG, CE, 

ADG, AFG, BEG, CFG, 
ABCD, ABEF, ABFG, ACDF, 
ABDE, ACEG, ADEF, BCDE, 
BCEF, CDEF, ABCEG, 
ABDFG, ACDEG, ACEFG, 
BCDFG, BDEFG, ABCDEFG 


12. Fractional Factorial Designs 


Discussions on the versatility of 2” factorial treatment designs are continued in this 
chapter. Experiments using only a fractional replication of the factorial arrangement 
are proposed as a means of effectively obtaining information on factors in the early 
stages of experimentation. Methods for constructing the designs are described and 
the analysis to estimate and test significance of factorial effects is illustrated for 
large 2” factorial experiments without complete replication. 


12.1 Reduce Experiment Size with Fractional 
Treatment Designs 
The 2” factorial treatment designs are useful for conducting preliminary studies 
with many factors to identify the more important factors and factor interactions. 
However, the number of experimental units increases geometrically with the num- 
ber of factors in the study. 

Fractional factorial designs use only one-half, one-fourth, or even smaller 
fractions of the 2” treatment combinations. They are used for one or more of sever- 
al reasons, including 

e the number of treatments required exceeds resources 

e information is required only on main effects and low-order interaction 

e screening studies are needed to check on many factors 

e an assumption is made that only a few effects are important 


Some industrial research and development studies can exceed the capacity of 
the research facility if all treatment combinations are included in the experiment for 
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a 2” factorial. For example, one complete replication of a 2° study requires 128 
runs of a process. The observations from the 128 runs provide estimates of 7 main 
effects, 21 two-factor interactions, 35 three-factor interactions, and 64 interactions 
including four or more factors. The 128 treatment combinations provide a large 
amount of information, perhaps even more than is necessary for high-order 
interactions. 


Reasons for Fractional Factorial Use 


Design Redundancy in the Absence of High-Order Interactions 


The experiences with studies involving many factors have led to the observa- 
tion that at some point higher order interactions tend to become negligible and may 
be ignored in the overall scheme of preliminary investigations. In practice main ef- 
fects generally tend to be larger than two-factor interactions, which in turn tend to 
be larger than three-factor interactions, and so forth. 

The complete factorial has a degree of redundancy if the investigator can be 
reasonably assured that the high-order interactions are negligible. Under these cir- 
cumstances estimates of main effects and low-order interactions can be obtained 
from a fraction of the full factorial treatment design. 


The Factor Sparsity Hypothesis 


The use of fractional factorial designs in industrial research (Diamond, 1989) 
or biotechnology (Haaland, 1989) rests to a large degree on a factor sparsity 
pistes ws & Ss sss Be assumes a Pn see ameaemegpeaal 


Screening Studies with Fractional Factorials 


The fractional factorial also is useful in screening studies that include “major” 
factors and “minor” factors. The effects and interactions associated with major fac- 
tors are the primary objective of the study. However, the study can include a num- 
ber of minor factors that must be checked for their effects even though most or all 
are expected to be negligible. The features, construction, and analysis of fractional 
factorials are explored in this chapter beginning with the half fraction experiment in 
the next section. 
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12.2 The Half Fraction of the 2” Factorial 


The half fraction design is referred to as a 2”~' fractional factorial design because 
52" = 2”—!. The notation indicates the design includes n factors each at two levels 
that use only 2”~! experimental units. 

When one replication of 2” factorial was placed into two incomplete blocks in 
Chapter 11 a defining contrast was used to separate the treatment combinations into 
two sets. Each of the two sets was placed into one of the incomplete blocks accord- 
ing to the + and — coefficients of the treatment combinations in the defining 
contrast. Each of the blocks was h a me a ene — of the treatments. 

ist was confounded with blocks, it was possible to esti- 


The same principle is used to construct fractional factorial designs. A half 
replicate of the design consists of all treatment combinations with the + coeffi- 
cient of the defining contrast. 


Example 12.1. Truck Leaf Spring Manufacture Revisited 


Recall from Example 11.1 that an assembled truck leaf spring was passed 
through a high-temperature furnace. Afterward, it was put into a forming 
machine to induce curvature in the spring by holding the spring in a high- 
pressure press for a short length of time. The factors and levels for the 2° 
factorial were (A) furnace temperature at 1840° and 1880°F, (B) heating 
time at 23 and 25 seconds, and (C’) transfer time at 10 and 12 seconds. 

The eight treatment combinations, spring quality observations, and coeffi- 
cients to estimate the factorial effects are shown in Table 12.1. Notice the 
eight treatments are divided into two groups of four treatments using the de- 
eas contrast | based on ine ABC, three-factor interaction. This particular 

erinienantl ope’ been ‘ional construct ami incomplete block | 


Table 12.1 Treatment combinations required for a 2° factorial treatment design 


Factorial Effect 
Treatment I A B ben reO Hk) nit oa ABC y 
a fF + ean _ = + Sia “fe = 
b 4 ee cs of es Sa es as 28 
C “tp a - + + =“ = + 48 
abc af “t ne + + + ae fe 29 
(1) + oe = = + “3 “ aa 32 
ab - + ss = + a a — 31 
ac tr ch = +: oi = e a ey 
be rf =: r + ae a a ms 28 
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If the engineers wanted to construct a half replicate fractional factorial design 
they would use the four treatments in the top half of Table 12.1 with a + coeffi- 
cient for the ABC factorial effect. 


Gains and Losses with the Fractional Factorial 


Each Treatment Effect Has an Alias 


The gain in reduced size of the experiment comes with a price—the loss of 
some information on = treatment effects. If a half ig eens of the 2° factorial is 
_ we lose re aes estimate the three-factor interaction and each main_ 

reed oe cd ee The contrasts for the 


main oe ae A B. aad C are 


lI, =a—b—c+abe = 35 — 28 —48 +29 = — 12 
Ip = —a+b—c+abe = — 354+ 28 —48 +29 = — 26 
Ic = —a-—b+c+abe = —35—28+ 484+ 29 = 14 


Contrasts for the BC’, AC, and AB interaction effects are 


lpg =a—b—c+abc = 35 — 28 —48 +29 = — 12 
lag = —at+b—ct+abe= —35+4+28 —48+29 = — 26 
lap = —a—b+c+abe = —35—284+48+29 = 14 


Notice that 14 =/sc and the contrast 1 = (a—b—c+abc) estimates the com- 
bined effect A + BC, making it impossible to differentiate between the main effect 
of temperature (A) and the interaction between heating time and transfer time 
(BC). Two effects estimated by the same contrast are known as aliases. 

Similar relationships exist with the linear contrasts for the main effects of heat- 
ing time (B) and transfer time (C), 1p = lac and Io = I,p. The linear contrast lp 
estimates B + AC and Ic estimates C + AB. Therefore, B and AC are aliases and 
Cand AB are aliases. The alias relationships are 


A= BC 
AC 
C=AB 


The ABC interaction used as the defining contrast is known as the design gen- 
erator. The three-factor interaction ABC’ used as the defining contrast cannot be 
estimated from that half replication because it has the same coefficients (+ ) in 


12.2 THE HALF FRACTION OF THE 2” FACTORIAL 395 


Table 12.1 as the estimate of the mean, or the identity column J. The identity 
relationship 7 


i Ar 


is known as the defining relation for the design. 


Generate the Design with Either Half of the Design Generator 


Either half of the ABC contrast can be used for the design generator. If we use 
the treatment combinations with a — coefficient for the ABC interaction from the 
bottom half of Table 12.1 as the half replication, the defining relation for the design 
is J =—ABC. The contrast for the main effect of temperature (A) is 


l,4 =[—(1)+ab+ac — bc] 
The contrast for the interaction between heating time and transfer time (BC) is 
lpc = [() — ab — ac + bc] 


Thus, 14 = —I1gc; the temperature main effect is the negative of the heating time 
and transfer time interaction effect. 

Similarly, the main effects of heating time (B) and transfer time (C) are the 
negative of the AC’ and AB interactions, respectively. The relationships between 
the main effects and two-factor interactions are A=—BC, B=-—AC, and 
C =—-AB. Regardless of which design half is used, the main effects are aliased 
- with the two-factor interactions. 

The alias for any treatment effect can be determined readily from its gener- 
alized interaction with the defining contrast ABC’. (The determination of gener- 
alized interactions was explained in Chapter 11.) The aliases for the main effects 
are determined from their respective products with the defining contrast as 


Ax Bb =/°RC = BC 
Bx ABC =ARC = AC 
CHXABC = APC? = AB 
Had the defining relation J = —ABC been used, the aliases would be 
A x (-ABC) = -A’?BC = -BC 
B x (-ABC) =-AB’*C =-AC 
C x (ABC) =-ABC? =-AB 


Redundancy in 2” Factorials 


When there are no two-factor or three-factor interactions in the 2° factorial, the 
main effects for A, B, and C can be estimated from a half replicate of the design 
defined by either J = ABC or J =—ABC. Thus, in the absence of interaction 
there is a redundancy in the complete design such that main effects can be 
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estimated with two different contrasts, each from a different half of the design. The 
half fraction derived from J = ABC’ is commonly known as the principal fraction, 
while the half fraction derived from J = — ABC is the complementary fraction. 

The two half fractions form a complete 2° design. If each is run on a separate 
occasion the resulting design is an incomplete block design with two blocks of four 
treatments each. The ABC interaction is confounded with blocks, but all main 
effects and two-factor interactions can be estimated. 


How to Construct Half Replicate 2”~ + Designs 


The half fraction design is constructed with the highest order interaction as the de- 
sign generator. The treatment combinations are identified as follows: 


e Write the + and — coefficients in standard order for the factors in a 2”7! 
factorial. 


e Identify the + coefficients for the nth factor by equating them to the 
coefficients for the highest order interaction in the 2”~! factorial. 


The construction of a half replicate 24~! design is illustrated in Table 12.2. The — 
and + coefficients for the main effects of A, B, and C for the 24~! = 2° factorial 
are written in standard order. The coefficients for the ABC interaction are taken 
as the level of the fourth factor, D, to be combined with the levels of the three 
other factors. The coefficients for the ABC interaction contrast are obtained as the 
product of the coefficients for the three main effects, as shown in the fourth column 
of Table 12.2. 


Table 12.2 Construction of a 2*! fractional factorial design 


A B & D=ABC_ Treatment 
= - : = ) 
< + - ss ad 
= + = + bd 
as a - = ab 
= = - 5 cd 
+ = — - ac 
_ +- + — bc 
+ a3 + — abcd 


For example, the product of the coefficients for A, B, and C in the first row of 
coefficients is a — sign. Thus, the first row of coefficients is( — — — — ) for the 
treatment combination (1). The second row of coefficients is (+ — — +) for the 
treatment combination ad. 
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Build the Half Replicate with the Evens—Odds Rule 


The highest order interaction is the design generator on the basis of the Evens— 
Odds rule. The resulting treatment combinations required for the half replicate are 
shown in the right-hand column of Table 12.2. They are the treatment combinations 
that receive a + coefficient in the ABCD interaction by the Evens—Odds rule 
since each has an even number of effect letters. 

A table of design generators for fractional factorials is given in the Appendix. 
The generator for the 24! design in Table 12.2 is listed in Appendix Table 12A.1 
as D = ABC to indicate how the levels for the fourth factor, D, are determined 
from the levels of the other factors. 

The design in Table 12.2 is a complete factorial for A, B, and C if factor D is 
omitted from the design. Regardless of which factor is omitted from the half frac- 
tion design, the resulting design is a complete factorial in the remaining effects. 


Choose the Highest Order Interaction for the Design Generator 


The highest order interaction of least interest ordinarily is used to generate the 
half replicate because the defining contrast chosen for the design generator cannot 
be estimated. Suppose a half replicate of a 2° factorial is generated with the 
ABCDE contrast. The design requires 2°-! = 16 experimental units. The com- 
plete set of aliases is shown in Table 12.3. 


Table 12.3. Aliases for main effects and two factor interactions in a half fraction 
of the 2° factorial design 


Main Two-Factor 

Effects Alias Interactions _ Alias 
A BCDE AB CDE 
B ACDE AC BDE 
C' ABDE AD BCE 
D ABCE AE BCD 
E ABCD BC ADE 
BD ACE 

BE ACD 

CD ABE 

CE ABD 

DE ABC 


Each main effect has a four-factor interaction as an alias, and each two-factor 
interaction has a three-factor interaction as an alias. For example, the alias of B is 
found as the generalized interaction B x ABCDE = AB*CDE = ACDE, and 
the alias of DE'is DE x ABCDE = ABCD*E? = ABC. 
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Use the Fractional Factorial to Guide the Experimental Process 


A sound experimental strategy in industrial research will utilize fractional fac- 
torial designs. For example, a half replicate of the 2° factorial requires 16 runs of an 
experimental process. Suppose the first 16 required runs were completed. The sec- 
ond fraction of 16 runs can always be run later in a second block if necessary to 
complete the factorial. There may be sufficient information on main effects and 
two-factor interactions from the first fraction that signals the need to change levels 
of some factor(s) or allows elimination of some inert factor(s). Thus, the investiga- 
tor can move on to the next stage of experiments. The resources previously needed 
for the second set of 16 runs can be more wisely expended on the new experiments. 

Randomization with fractional factorials is achieved by performing the actual 
runs of the treatment combinations in a random order if the treatments must be 
tested in sequence. If all treatments are tested on a set of physical experimental 
units they are randomly assigned to the units. 


12.3 Design Resolution Related to Aliases 


Fractional factorial designs are grouped into classes according to existing alias rela- 
tionships in the design. The groups are identified by their resolution. The most 
common designs are those of Resolution III, IV, and V. 


solution III: A design in which no main effect is confounded with any other main 
effect. ae main effects are confounded with two-factor interactions and two-factor 
interactions are confounded with other two-factor interactions. 


Resolution IV: A design in which no main effect is confounded with any other 
main oes or two-factor interaction, but two-factor interactions are confounded 
with one another. 


Resolution V: A design in which no main effect or two-factor interaction is con- 
founded with any other main effect or two-factor interaction, but two-factor inter- 
actions are confounded with three-factor interactions. 


The resolution of a design is determined by the smallest number of characters 
appearing in the design generator. The 2°~! design generated from the ABC con- 
trast with three characters was a design of Resolution III, because main effects were 
not confounded with each other but were confounded with two-factor interactions. 
The 2°-! design generated from the ABC DE contrast with five characters was a 
design of Resolution V because there was no confounding among the main effects 
or two-factor interactions, but the two-factor interactions were confounded with 
three-factor interactions. The 24~! design generated from the ABCD interaction 
with four characters is a design of Resolution IV. 

In general, any fractional factorial design of Resolution R has no q-factor 
interaction confounded with any effect consisting of less than R — q factors. The 
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notation used to identify a design along with its resolution contains a Roman sub- 
script indicating the design resolution. Thus, the notation 2°! identifies a 2°~! 
fractional factorial with Resolution V. 


12.4 Analysis of Half Replicate 2”' Designs 


The analysis of replicated factories! experiments was discussed and illustrated in 
ee ee = 7, and - Th, replicate esesieris provided an estimate of experimental - 
ice from | al f variance. However, with fractional replication of 

mental error is available to evaluate the 


ctor effe . A strategy commonly used to esti- 
mate Senet error aoe fractional replications of 2” experiments is illustrated 
with a 2°! fractional factorial experiment in Example 12.2. 


Example 12.2 Polymer Coatings for Aluminum Cases 


Research Objective: A company had received a contract to manufacture alu- 
minum cases with a plastic polymer coating on the case for transceivers. A 
research team was given the responsibility to develop a process to adhere the 
polymer coating to the aluminum. The team identified five factors in the 
process that they thought had any potential to affect adhesion of the polymer 
to the aluminum. They were the type of aluminum alloy (A), the type of sol- 
vent used to clean the aluminum (5S), the molecular structure of the coating 
polymer (M), the percent catalyst used in the adhesion process (C), and the 
curing temperature for the process (7). 


Treatment Design: A 2° factorial design was chosen so they could take ad- 
vantage of the factorial treatment structure to evaluate important interactions 
along with the main effects. They used two aluminum alloys, two types of 
solvents, two molecular structures for the polymer, 10% or 15% of the cata- 
lyst, and 150° or 175° as a curing temperature. 


Experiment Design: With this preliminary experiment, the team wanted to 
identify any main effects and two-factor interaction effects that were impor- 
tant to the process. They also wanted to reduce the number of experimental 
units and time required for this first experiment. 

They chose to use a 2?,! fractional factorial or a half replicate design that 


required only 16 experimental units. The five-factor interaction ASMCT 
was the defining contrast used to generate the design. Thus, they could run 
the second half of the design later if it became necessary to complete the fac- 
torial. Likewise, the half fraction might provide sufficient information to sig- 
nal the need to alter the levels of some factors or eliminate some with no 
effect. 

The treatment combinations, the coefficients for main effect and two- 
factor interaction contrasts, and the responses from the experimental runs are 
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shown in Table 12.4. The force required to remove the plastic coating from 
the aluminum case was used as the response variable. 


Table 12.4 Force required to remove plastic coating from an aluminum surface 
for treatments in a 2?! fractional factorial experiment 


Design | 

A S M CF AS AM AC AT SMSC ST 38 BF CF Degen Fone 
6 ee EE Nc ee OE a AR A OE OS AS a 39.6 
ns Fe es ae ee ee eee S 43.9 
oo See ee ee ast 38.8 
pa aa PO een ee a REE DS EE OE ee, EEC eee Pen.” m 48.7 
S rs he ac ee ee amt 52.0 
i A On er ee ee smt 55.8 
ae th es ee et ae ee ee sy asm 43.2 
ret ee a a ee ee ee c 59.5 
et A ee Se ee ae eee eee ee act 42.6 
Se a ee = ae sct 44.0 
Eg SE Mee NR ced te, UO eg a eee oe asc 33.8 
aac se ee ee es a ee met 238 
5 wt, SES as es i ee ee ee ee ee — amc 48.1 
ro eds HB, ene aeislges arte 0) oe eres oh) ee Tees 2 ~ smc JL. 
AR ect Cy tbescthc cehesate. ectha: bebe icohetes tasathc SF fi iceerag Go 


asmct 48.7 


"Example computation for effect estimates [see Equation (12.1)]: : 
A = 2(—41.5 + 39.6 — 43.9 + 38.8 — 48.7 + 52.0 — 55.8 + 43.2 — 39.5 + 42.6 — 44.0 + 33.8 — 53.6 
+ 48.1 — 51.3 + 48.7) /16=2(— 31.5)/16= — 3.94 


The + and — coefficients for the first four factors (A, S, M, and C) of a 
25-1 — 24 factorial are written in standard order in Table 12.4. Coefficients for the 
AS MC interaction are used to identify the levels of the fifth factor (7) to be used 
in the treatment combinations. The 16 treatment combinations shown in Table 12.4 
all have a + coefficient in the five-factor interaction, AS MCT’, because they have 
an odd number of effect letters. 

The 2°, fractional factorial design has no main effect or two-factor interaction 


aliased with any other main effect or two-factor interaction (see Table 12.3). The 

main effects are aliased with four-factor interactions (for example, 14 = lsycr), 

and two-factor interactions are aliased with three-factor interactions (for example, 

las =lycr). Assuming the three-factor and four-factor interactions are negligible, * 

estimates of the main effects and two-factor interactions can be used to identify the 

candidate factors for more in-depth investigation in follow-up studies if necessary. 
The effect of a factor in a fractional factorial is estimated as 


_ ae) 


AIS. N 


(12.1) 
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where /4p__ is the contrast of the treatment combinations and JN is the total number 
of observations in the experiment. The 1 degree of freedom sum of squares for an 
effect ina 2”~! fractional factorial is 


SS(AB...) = 


Gan. (12.2) 


The estimates of the main effects and two-factor interaction effects are listed in 
order of increasing value from — 3.94 to + 9.71 in Table 12.5. An example calcu- 
lation for the effect estimate of factor A is given at the bottom of Table 12.4. 
The estimates of the other effects can be verified by using Equation (12.1). 


Table 12.5. Estimates of main effects and two-factor interactions with their nor- 
mal scores sum of squares from the 2°~! fractional factorial data in Table 12.4 


: Normal Sum of 
Effect Estimate Quantile Squares 
A — 3.94 — 1.74 : 62.02 
AS — 3.69 — 1.24 54.39 
S — 0.76 — 0.94 22.38 
SC  — 0.74 | — 0.71 2.18 
AM — — 0.41 — 0.51 0.68 
C — 0.24 — 0.33 0.23 
SM — 0.09 — 0.16 0.03 
AC 0.14 0.00 0.08 
ST 0.16 0.16 0.11 
Be 3 0.44 0.33 O77 
AT 0.74 0.61 2.18 
MC 0.74 0.61 2.18 
MT 1.09 0.94 4.73 
ig 3.61 | 1.24 52.20 
M 9.71 1.74 STT33 


Examine Effects with a Normal Probability Plot 


The observations from the experiment are assumed to be normally distributed with 
a constant variance 0”. The observations represent random normal variation about a 
fixed mean if the changes in the levels of the factors have no real effect on the re- 
sponse. Thus, the estimated factorial effects from these observations are normally 
distributed about a mean of 0 with variance a” when the factors have no effect. The 
estimated effects will plot on a straight line in the normal probability plot (see 
Chapter 4). Estimated effects that do not fit on the straight line cannot be explained 
as random chance variation. 

The corresponding normal quantile computed for each effect is shown in Table 
12.5. Recall that the normal quantile is the expected value for the normally 
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distributed variable with a mean of 0 and variance of 1 found in that position of the 
ordered effects (see Chapter 4). 

The normal probability plot of main effects and two-factor interaction effects is 
shown in Figure 12.1. The main effects that do not plot on the straight line of the 
normal probability plot are the effects of the aluminum alloy (A), molecular struc- 
ture of the polymer (/), and the curing temperature (T7'). The effect estimate for 
the interaction between aluminum alloy and type of solvent (AS) also deviates con- 
siderably from the straight line of the normal probability plot. All other effects ap- 
pear to be representative of random experimental error. 


Interpretations of the Largest Effects 


Graphs of the effects are shown in Figure 12.2. The positive estimate for 7’, 3.61, 
in Figure 12.2a indicates a high level of curing temperature produced stronger 
adhesion. Likewise, the positive estimate for M, 9.71, in Figure 12.2b indicates 
the molecular structure represented by the “high” level produced stronger adhesion. 


Standard normal quantiles 


A: sf © 2 4 6 5 16 
Effects 


Figure 12.1. Normal probability plot of estimated factor effects and interactions 
from the 2°! fractional factorial in Example 12.2 
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(C) (d) 


Figure 12.2 Graphs for the main effects of T°, M, and A and the AS interaction 


The negative estimate for A,—3.94, in Figure 12.2c suggests the “low” level of 
the aluminum alloy factor produced a stronger adhesion. However, the large esti- 
mate for AS interaction, —3.69, may alter the inference about the main effect of A, 
the aluminum alloy. 

An investigation of the interaction between the aluminum alloy (A) and the 
solvent used to clean the aluminum (S’) requires the averages of the four treatment — 
combinations involving A and S. From Table 12.4 the values are 


Treatments A S§ Total Mean 
t+m+cec+met GO 0 1353 ° 458 
a+amt-+ act + amc bo Ae 456 
s+smt-+ sct + smc O 1 d850 . 48:8 
ast + asm + asc + asmct |. frorte4s5 41.1 
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The effect of A when S = 0 is 45.6 — 45.8 = — 0.2, whereas the effect of A 
when S = 1 is 41.1 — 48.8 = — 7.7. Therefore, the difference between the two 
alloys with the second solvent (S = 1) is much greater than that with the first sol- 
vent (S = 0), as shown in Figure 12.2d. 


Some Cautions About the Assumptions 


It is important to remember that because of aliasing the estimated effects judged 
important for the process are really A+ SMCT, M+ ASCT, T+ ASMC, and . 
AS + MCT. Tos the assumption of negligible. Uo Seed and bigher J interac- y 

tions is crucial to the decisions regarding the importance | 
and one two-factor interaction to the bonding HEUER. 

Daniel (1959) pointed out some possible misinterpretations with a normal plot 
of estimated effects in unreplicated factorials. A perceived smooth line of plotted 
effect estimates in one portion of the normal plot thought to indicate no evidence of 
factor effects could lead to an overestimate of error variance. An extremely irregu- 
lar line could lead to an opposite error of judging effects real when in fact they only 
represent random experimental error. The plot is most effective when only a small 
proportion of the factorial effects are important. 


One Method to Estimate Experimental Error Variance 


A test for the significance of these effects requires an estimate of experimental error 
variance. One general strategy to obtain an estimate involves several steps. The first 
step selects those effects that appear to be negligible on the normal probability plot 
and pools their sums of squares to estimate experimental error. Based on Figure 
12.1, only the effects for A, M, T, and AS appear to be significant and all other ef- 
fects appear to be negligible and representative of random experimental variation. 
The analysis of variance for a model containing effects only for A, M, T’, S,and 
AS is shown in Table 12.6. The main effect for S was included in the analysis be- 
cause the interaction effect AS’ is defined on the main effects in the model (Chapter 
6) and should properly be included in the model to obtain correct sums of squares 


Table 12.6 Analysis of variance for the reduced model with A, M, T, and AS ef- 
fects for the 2°~! fractional factorial (Example 12.2) 


Source of _— Degrees of Sum of Mean 

Variation Freedom Squares Square F fro 
Total 15 = 561.41 

A l 62.02 62.02 44.11 0.000 
M ] 379.33 211.90 268.36 0.000 
T l 5220 52.20 37.12 0.000 
S 2 l Dea 2.33 pegs 0.213 
AS ] 54.39 54.39 38.68 0.000 
Error 10 13.14 1.31 
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values. Notice that S' is not significant and could be pooled with the sum of squares 
for error following the general strategy. The pooled sum of squares for the effects 
that appear negligible is SSE = SS(Error) + SSS = 13.14 + 2.33 = 15.47 from 
the analysis of variance in Table 12.6 with 11 degrees of freedom. Thus, an esti- 
mate of experimental error variance is s? = 15.47/11 = 1.41. 


No Substitute for Replication 


A replicated experiment would have provided a legitimate estimate of a”. The 
validity of any significance test is questionable when the variance estimate is based 
on subjective judgments. The method just described to estimate experimental error 
variance from an unreplicated experiment is very subjective. 

A number of more formal procedures have been developed to address the prob- 
lem of variance estimation in unreplicated fractional factorials. However, each of 
them requires a degree of subjective judgment on the part of the investigator. None 
of them are discussed here but references are given for further reading. 

Berk and Picard (1991) reviewed and evaluated some of the methods including 
one of their own. More computational intensive procedures were proposed by Box 
and Meyer (1986) and Zahn (1975a, 1975b). The Zahn methods are extensions 
of the original half normal plot methods of Daniel (1959). Some of the procedures 
(Lenth, 1989; Voss, 1988) proposed simpler methods based on standard analysis of 
variance procedures. 

On the basis of simulation studies, Berk and Picard (1991) concluded most of 
the methods produced nearly identical error rates and most were prone to declare 
null effects to be real. 

The dangers inherent in unreplicated fractional factorials cannot be ignored. 
Only replication can protect against selecting spurious effects as legitimate effects. 
Fractional factorials are legitimate studies if they are recognized as preliminary 
screening studies to prepare the way for more rigorous replicated experiments. 


Standard Errors and Tests of Hypotheses About Effects 


The variance of an effect estimate in a fractional factorial is 


: 4c? 
variance = a5 (12.3) 


where o” is the experimental error variance and N is the number of observations in 
the experiment. The estimated experimental error variance is s? = 1.41 with 11 de- 
grees of freedom from which a standard error estimate-can be calculated for each of 
the effects in Table 12.5. The variance estimate for any effect is 


4s? 4(1.41) 


N 16 = 0:35 


and the estimated standard error of the effect estimates is ,/0.35 = 0.59. The 
significance of each effect can be determined with the Student ¢ test as a ratio of the 
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effect estimate to the standard error or with the F' test as shown in Table 12.6. All 
of the suspected effects are significant according to the F tests in Table 12.6. 


A Residual Plot to Evaluate the Model 


The estimate for experimental error variance was found by assuming all effects 
other than those in the model were negligible. If effects other than those in the 
model are negligible the residual plots for the fitted model should provide evidence 
for correctness of model choice. The residuals were computed for the model includ- 
ing only the A,M,7, S, and AS effects. The plot of residuals versus predicted 
values and the normal probability plot of the residuals are shown in Figure 12.3. 
The residuals all appear to lie on a relatively straight line in the normal probability 
plot, and there does not appear to be any reason to suspect heterogeneous variances 
in the residuals versus the fitted values plot. Also, no major effects appear to be ab- 
sent from the model since there are no evident outliers in the residual plots. 
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Figure 12.3 Residual plots for the model with A,M,7T,S, and AS effects for 
Example 12.2 


The catalyst, factor C’, had no detectable effects and appears to be an inert fac- 
tor. Thus, the design was effectively a single replicate of a complete 2* factorial in 
the other factors A, S, M, and 7. The research team now can consider an in-depth 
study of the other factors and their interactions. 


12.5 The Quarter Fractions of 2” Factorials 


Smaller fractions for initial runs of the 2” factorial may provide sufficient informa- 
tion for critical decisions about the effectiveness of factors, especially when a 
moderately large number of factors are under consideration or individual runs are 
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quite expensive. The quarter replicate of the 2” design is designated as a 
42” = 2"? fractional factorial. 

A quarter replicate of a 2° design is a 2°~? fractional factorial that requires on- 
ly 16 runs of the 64 runs required for the complete 2° design. The quarter replicate 
corresponds to one block of the incomplete block design for a 2° design in four 
blocks of 16 units per block (see Chapter 11). Any of the four blocks could be used 
for the 2°? design. 


Use Two Defining Contrasts for Quarter Fraction Designs 


Two defining contrasts are required to construct the four incomplete blocks, and 
they along with their generalized interaction are confounded with blocks. These 
same defining contrasts can be used to construct the 2°~? design. Suppose ABDE 
and ABC'F are chosen as the design generators. The generalized interaction of the 
design generator is 


ABDE x ABCF = A’B’CDEF =CDEF 


The design generators along with their generalized interaction complete the defin- 
ing relationship for the design as 


i= ADDE—=ABCE = CDE 


The treatment combinations required for the design can be identified in a man- 
ner analogous to that demonstrated for the half replicate design in Table 12.2. The 
+ and — coefficients for the main effects of A, B, C, and D are written in stan- 
dard order for a 2°-? = 24 complete factorial as shown in Table 12.7. 

The aliases of & and F' that contain any of the factor letters A through D are 
used to determine coefficients for & and F' in the treatment combinations. Given 
the design generators ABDE, ABC'F, and their generalized interaction CDEF, 
the aliases of & and F’, respectively, are 


E=ABD=CDF = ARCEF 
and 
FSeARC=CDE = ABDEF 


The equivalencies & = ABD and F' = ABC can be used to determine the 
coefficients for EF and F’. That is, the coefficients for ABD are taken ds the coeffi- 
cients for &, and the coefficients for ABC are taken as the coefficients for F’. The 
treatments required for the design are shown in Table 12.7. 

The defining contrasts for the quarter replicate design have four letters, so the 
resulting design is of Resolution IV, or a 2°;? fractional factorial. No main effect is 
aliased with any other main effect or two-factor interaction, but two-factor interac- 
tions are aliased with one or more other two-factor interactions. The complete set of 
aliases are shown in Table 12.8. | 
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Table 12.7 Construction of a 2%;? fractional factorial design 


aN 


+i titi tit+ttiti+i¢+4 


Table 12.8 
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Design 
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a +s 
— g8 
es - 
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relationship / = ABDE =ABCF. = CDELF 


Effect 


Alias from 
ABCF 
BCF 
ACF 
ABF 
ABCDF 
ABCEF 
ABC 


i oe a i ee 


I+ 1 ++141 


f= ABD F =AbDC Treatnent 


(1) 
ae f 
bef 
ab 
cf 
ace 
bce 
abcf 
de 
adf 
bdf 
abde 
cde f 
acd 
bcd 
abcde f 


Alias relationships for the 2{;? fractional factorial with defining 


CDEF 


ACDEF 
BODE 


Dit 
CEF 
CLF 
CDE 


ABCDEL 


ADEF 
ACE F 
ACDF 
BOUEL 
BE 
DF 
AEF 
ADF 


A 1/2? fraction of a 2” design is designated as a 2"? fractional factorial. In 
general a 1/2? fraction of the 2” factorial will require p design generators with 
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2? — p — | generalized interactions. Each effect will have 2? — 1 aliases. The 1/2? 
or quarter fraction 277” design in Table 12.7 required p = 2 design generators 
resulting in 2* — 2 — 1 = 1 generalized interaction. There were 22 — 1 = 3 aliases 
for each effect. 


Choose Design Generators to Avoid a Bad Alias Structure 


Care must be exercised when selecting design generators to avoid aliasing those ef- 
fects of interest in the study with one another. Thus, it is important to look at the 
alias structure prior to assigning actual factors to the factor letters when setting up 
the experiment. 

Notice that the 27, design in Table 12.7 consists of a complete replication of 


the 2* design in A, B, C, and D. It is also a complete replication of a 24 design in 
any four factors that do not combine to form the defining relationships for the de- 
sign. Any combination of factors other than ABDE, ABCF, and CDEF will 
produce a complete 2* factorial. Upon close inspection the design in Table 12.7 
is seen to consist of two replications of a 2*~! design for experiments, consisting of 
the four factors used in any one of the defining relationships considered separately. 

The 2"? design will contain a complete factorial for any n — p factors that do 
not make up a defining relationship. A fractional factorial can be made up from 
some subset of n — p factors. If it is suspected at the outset that some of the factors 
will have negligible effects, then the original 2"~? fractional factorial can be set up 
so that a full factorial will exist for the factors of most interest or those expected to 
have real effects. 


12.6 Construction of 2”? Designs with Resolution III and IV 


Resolution III Designs with N Experimental Units for N — 1 Factors 


A design of Resolution III can be constructed to investigate n = N — 1 factors us- 
ing N experimental units, where N is a multiple of 4. For example, the half frac- 
tion of a 2° factorial in Table 12.1 is a 237! fractional factorial that has N — 1 = 3 
factors investigated with N = 4 experimental units. Designs of Resolution III have 
no main effects aliased with each other but do have main effects aliased with two- 
factor interactions. 

A 2"? design of Resolution III requires p design generators. Suppose a design 
of Resolution III is desired for N — 1 =7 factors in N = 8 runs. The eight runs 
would be a (1/16) fraction of the 2° = 128 runs required for a full factorial. Thus, 
the design is a 2/7* fractional factorial requiring p = 4 design generators. 

The design construction begins by writing the + and — coefficients for 
n — p main effects in standard order. The + and — coefficients for n — p = 3 
main effects, A, B, and C, in standard order for a complete 2’~4 = 2° factorial are 
shown in Table 12.9. 
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Table 12.9 A 2%,‘ fractional factorial with design generators ABD, ACE, 
BCF, and ABCG | 


a a ep een ee ane 
Unit A-<B? C D=AB E=AC . Fe= BC” G= ABC Treatment 


== + a + =< def 
iaurke Hs be 7 “ “ + afg 
3 rae ee = = = + beg 
ee eS Bee t a = = abd 
5 ee = = re Ae cdg 
6 eee ae + = a ace 
7 <a ae = ei 7 = bcf 
ie ee 43 ag te + abcde fg 


The coefficients required for the remaining factors, D, E, F’, and G, are asso- 
ciated with the coefficients of all the interaction columns of A, B, and C’. Thus, the 
columns that indicate the levels for factors D, E, F, and G are generated from the 
equivalence relationships 


D= Ab Ee AC | Soi Bt and G = ABC 


The p = 4 design generators are then 


Dx AB= ABD 
Ex AC = ACE 
Fx BC=BCF 


and 


G x ABC = ABCG 


The defining relationship is found by multiplying the four generators in all 
possible ways. The complete defining relationship is 


I = ABD = ACE = AFG = BCF = BEG = CDG = DEF = ABCG 
= ABEF = ACDF = ADEG = BCDE = BDFG =CEFG 
= ABCDEFG 


The alias structure of the design for the main effects is found by multiplying 
each of the main effects by the interactions included in the defining relationship. 
The two-factor interactions aliased with the main effects are 
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A=BD=CE=FG 
B=AD=CF=EF 
C=AH=BF=DG 
D=AB=CG=EF 
E=AC = BG.= DF 
P= AG=BC=DE 
GeAP= BE =CD 


The design in Table 12.9 has all available contrasts of the 2”-? = 2? factorial 
associated with the main effects along with their aliases, and the design is referred 
to as a saturated design. The design has 7 degrees of freedom available to estimate 
each of the seven main effects and no degrees of freedom to estimate experimental 
error. Designs of Resolution III can be used in screening experiments with many 
factors to identify clearly dominant factors without major expenses of time and 
other resources. 


The Plackett-Burman Designs 


Resolution III saturated designs with n = N —1 factors requiring N =2"-? 
experimental units can be generated from the basic 2"-? factorial arrangement, as 
shown in the preceding paragraphs. The result is a design with the number of 
experimental units equal to the power of 2—experiments with 4, 8, 16,...,2”? 
units. 

A class of Resolution III designs developed by Plackett and Burman (1946) re- 
quiring a number of experimental units equal to a multiple of 4 have been used 
extensively for screening experiments in industrial research. They provide designs 
for intermediate values of N that are not a power of 2. Designs were provided for 
N < 100, except for 92, by Plackett and Burman (1946). A sample of the most 
practical and easily constructed designs is given in Table 12.10 for 
N = 12,16, 20,24, and 32 experimental runs; the designs can be generated from 
rows of + and — signs. eS 

Each row of generators in Table 12.10 has N — 1 coefficients. These are the 
+ and — coefficients required for the first run of the N — 1 factors. The factor 
coefficients for the second run are generated from the first run by taking the first 
coefficient in the first run and placing it in the last position of the second run and 
shifting the coefficients of the first run one position to the left. Succeeding runs are 
generated in the same manner. Upon completing the cycle of coefficient replace- 
ments for NV — 1 runs a final run is added with — coefficients for all N — 1 fac- 
tors. A design for NV — 1 = 11 factors in N = 12 runs is shown in Table 12.11. 
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Table 12.10 Generators for Plackett-Burman designs with N = 12, 16,20, 24, 


and 32 runs 
Or a 
2: +4 =] et Be ee 
16 aH SS ee a 
20. +4+-— +4 Sob Se s- - + + = 
M+} 4 ee ee ee 
3200 ee SP ee 


emer ——————e—eeeee_eeeee 


Table 12.11 A Plackett-Burman design for N — 1 = 11 factors in N = 12 runs 


a  ————— 


Factor 
Run 1 2 3 4 5 6 Z 8 areas | See 
lon theat, ood SeicsiteAT Heie re = fe Sr Sees 
28 tba) ots Besiioeesare = ea 

330 spooky ase 1 ae = qa ea os 2 a 
Acs ee © Tene els Stes es Ss 
5 ee, Me ee = i 22S oe Se 
GC. ee Pe ae soe 2 
Se ee a ee ee eo 
re ge ee ee ees 
ee ee ee ee as 
CS eS ~~ 
See Se ea Bee oe ee se Se he 
12° - = - — — — — — _ _ _ 


Although this 12-run design could be used to screen 11 factors, it has been 
recommended that the designs be used for 6 less factors than there are runs (Mason, 
Gunst, & Hess, 1989). Upon deleting 6 factors from the design, 5 degrees of free- 
dom remain for an estimate of experimental error variance after the main effects are 
estimated from the design. Of course, this practice still assumes the main effects are 
the dominant effects in the experiment. For example, a 12-run design for 6 factors 
is obtained by using only the first six columns of the design in Table 12.11. There 
are a total of 11 degrees of freedom in the design, of which 6 can be used to esti- 
mate the main effects and 5 used for an estimate of experimental error variance. 


Resolution IV Designs Require 2V Experimental Units for N Factors 


The 2”-? designs of Resolution IV have no main effects aliased with other main ef- 
fects or two-factor interactions, and main effects can be estimated free of confound- 
ing with two-factor interactions. Such was not the case with Resolution III designs. 
A 21,” design with N = n must have at least 2n experimental units or runs. 


12.7 GENICHI TAGUCHI AND QUALITY IMPROVEMENT 413 


A design of Resolution IV with n factors may be generated from a design of 
Resolution III with n — 1 factors by a fold-over technique. A second fraction is 
added to a Resolution III design for n — 1 factors in n runs. All signs in the second 
fraction are reversed from the first fraction. The J column in the first fraction has 
all + coefficients, whereas in the second fraction the J column has all — coeffi- 
cients. This column of coefficients is used for an added factor. The design now has 
n factors with 2n runs. 

The procedure is illustrated in Table 12.12 with the generation of a 2/7! design 
from a 2}7! design. The first fraction of the design is a 2°-! design of Resolution 
III, in which the coefficients for factor C are derived from the coefficients for AB 
interaction. The second fraction is derived from the first with opposite signs for the 
coefficients. The column originally labeled J is used for the fourth factor, D, to 
complete the 2*~' design. The design can be seen to be of Resolution IV since it is 
a half fraction of a 2* design requiring one defining contrast. It is a complete 2° fac- 
torial in A, B, and D with the coefficients of C derived from C = ABD, so that 
the defining contrast is C x ABD = ABCD. The resolution of the design is equal 
to the number of letters in the defining contrast ABC D, or Resolution IV. 


Table 12.12 Generation of a 2‘~! Resolution IV design by folding over a 23! 
Resolution III design 


eee 6, A B C=AB 

+ _ ~ - First fraction is a 27,! design 

= = oe = 

5 = + - 

a i - - 

<= “si + = Second fraction with opposite signs 
v = - > 

= a = + ~ 


12.7 Genichi Taguchi and Quality Improvement 


Finally, we note one special application of fractional factorial designs because of 
recent developments in the application of statistics to manufacturing, part of which 
deals directly with the issue of statistical designs. 

Fractional factorial designs are used extensively in off-line experiments for 
product quality improvement. Off-line investigations integrate engineering design . 
and statistical design principles to improve the quality of products and increase pro- 
ductivity. In particular, the Taguchi methodology (Taguchi, 1986) has had a major 
impact on product and process improvement design in manufacturing. 

Robust parameter design is the one part of the complete Taguchi method- 
ology that involves factorial treatment designs. The design consists of both factors 
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that are controllable in the manufacturing process and those that are not controlla- 
, ble. The controllable factors are called parameters in Taguchi’s terminology; those 
not controllable are called noise factors, or variables. The noise variables or factors 
are those most likely to be sensitive to changes in environmental conditions during 
production and thus transmit variability to the responses of interest in the process. 

Both types of factors, controllable and noise, can be controlled in off-line 
experimentation, and that is an important consideration in the Taguchi methodolo- 
gy. One objective is to determine which combination of controllable factors is least 
sensitive to changes in the noise variables. This is the concept from which the term 
robust parameter design is derived. The best choice of controllable factor levels 
leads to a manufacturing process that results in the desired product and is robust to 
any fluctuations in the uncontrollable noise factors. 

The design concept is illustrated with a hypothetical off-line experiment with 
three controllable factors, A, B, and C, each at two levels, and one noise factor, D, 
with three levels. An orthogonal array for the control factors is crossed with an 
orthogonal array for the noise factors. For our example with four factors, the 2° fac- 
torial for factors A, B, and C would constitute the inner array in Taguchi termi- 
nology, and the 3! for factor D would constitute the outer array. 

The three control factors are configured in Table 12.13 in a } fraction of a 2° 
factorial with 4 runs, or a 2%;/ fractional factorial, in the inner array. Three levels 


of the noise factor, D,in the outer array are used for each run in the 2}7'design, 
resulting in 12 runs for the experiment. 


Table 12.13 Hypothetical Taguchi experiment 


A B C Do dD, Dy Yy S 

= aes + 54 56 52 54 2.0 
+ = “< 69 70 WW 70 1.0 
> = 58 oe 49 54 4.6 
+ + i 58 65 69 64 5.6 


Suppose the manufactured product has a desired response target of 54. The 
mean, 7, and standard deviation, s, of the observations from the runs on the three 
levels of the noise factor, D, for each of the four runs on the control factors are 
shown in Table 12.13. 

Two of the treatments for the control factors,(—, —, +) and(—,+, —), 
meet the desired product response of 54. However, the (— , — , +) treatment, with 
a standard deviation of 2.0, is less sensitive to changes in levels of the noise factor 
(D) than the (—, +,-—) treatment with a standard deviation of 4.6. Thus, the 
product from the (—,—, +) treatment would be considered the more robust 
product since fluctuations on the noise factor (D) transmit less variability to the 
product’s response. | 

The Taguchi analysis method concentrates on the maximization of a signal-to- 
noise ratio (SNR) specific to the process goals. The primary choices for goals are 
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(1) minimize the response, (2) maximize the response, and (3) achieve a specified 
response target value other than minimum or maximum. 

The method proceeds rather simply with an analysis of variance for the SNR 
that determines which control factors affect the SNR. Then the method continues 
with an analysis of variance for the y that ascertains which control factors impact 
the mean response and thus which control factors’ levels can be set to achieve a 
process target response. Myers and Montgomery (1995) illustrate other details of 
the analysis for the interested reader. 

Pignatiello and Ramberg (1991) discuss major advantages and disadvantages 
of the Taguchi approach. Myers and Montgomery (1995), Montgomery (1997), and 
Vining (1998) present more detailed descriptions, examples, and critiques of the 
Taguchi methodology. A major contribution of the Taguchi method was the valida- 
tion of experimentation and statistical design as part of the quality improvement 
process. However, the Taguchi designs have evoked considerable controversy over 
the particular designs used and their implementation. 

Although many standard 2”~? fractional factorial designs are recommended in 
the Taguchi method, many are saturated or nearly saturated Plackett-Burman de- 
signs or fractional designs with three levels to detect response curvature. An 
assumption of no interaction among the control factors often is necessary for suc- 
cessful use of these designs. The crossing of orthogonal design for the control fac- 

tors with that for the noise factors provides ample ability to estimate interactions 
between control and noise factors, but the ability to estimate important interactions 
among the control factors has been sacrificed to do so. Myers and Montgomery 
(1995) suggest several design strategies that could be more economical of experi- 
mental runs and yet provide the means to estimate interactions of interest to the 
investigator. Hunter (1985) presents a detailed discussion of the Taguchi three-level 
fractional designs based on Latin squares. 

Better sequences of design strategies at the onset also have been suggested by 
Hunter (1989) and Pignatiello and Ramberg (1991). As one example, the sequence 
could start with 2”-” fractional factorials to detect interactions with factor levels 
added to the center of the design to detect curvature. Additional suggested refine- 
ments as the process is characterized include the use of complete factorials and re- 
sponse surface designs (see Chapter 13). 

As a final point, randomization is not advocated in the Taguchi approach; it 
can lead to important effects confounded with external variations being inaccount- 
ed for by the design. 


12.8 Concluding Remarks 


_ As we have seen in Chapters 11 and 12, factorials are versatile treatment designs 
that can be adapted to a variety of experimental conditions, including single 
replications, fractional replications, and experiments requiring incomplete block 
designs. 
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Only the rudimentary principles for the construction and analysis of these de- 
signs have been presented here. An extensive list of literature exists on fractional 
factorial designs for experimental work in industry. Extensive discussions on the 
design and analysis of fractional factorials in various fields of application may be 
found in Daniel (1976), Box, Hunter, and Hunter (1978), Diamond (1989), 
Haaland (1989), and Mason, Gunst, and Hess (1989). Some of the cited references 
contain tables of specific designs derived from the basic principles presented in this 
chapter. Diamond (1989) provides an extensive bibliography of published literature 
relating to fractional factorials. 

An abundant number of commercial software products have been developed to 
produce designs and analyses for any experiments that require fractional factorial 
designs. 


Fractional Designs for 3” Factorials 


Fractional replications may be constructed for 3” factorials and other factorial se- 
ries using the same principles as those for the 2” series. One-third and one-ninth 
fractional factorials are possible with the 3” series. Aliases for factorial effects are 
determined on the basis of symbolic products with the defining contrasts and 
squares of defining contrasts. An extensive selection of fractional factorial designs 
for the 3” series is available in Conar and Zelen (1959). Details on the construction 
of fractional factorials in other than the 3” series can be found in Kempthorne 
(1952) and John (1987). Examples of some designs can be found in Cochran and 
Cox (1957), Johnson and Leone (1977), and Montgomery (1997). 


EXERCISES FOR CHAPTER 12 


1. 


A list of designs is shown below. 


1. gp 

li. i 
tings Bie 
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Indicate the following quantities for each design: 


Oe Se 


the fraction of the full design 

the number of generators required 

the number of generalized interactions 

the number of aliases for each effect 

the number of experimental units or runs required 


Construct a 2°! fractional factorial design. Use J = ABC DE as the defining relationship. 
a. What is the resolution of the design? 
b. Show the alias structure for main effects and two-factor interactions. 


10. 
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Construct a2” fractional factorial design. Use ABC D and C DEF as the design generators. 
a. What is the generalized interaction? 

b. What is the resolution of the design? 

c. Show the alias structure for all main effects and two-factor interactions. 


Construct a Resolution IV design that is a 4 fraction of a 2’ factorial. 


If you fold over a 2}, fractional factorial, is the resulting design a 2°;? fractional factorial? 
Explain. 


A 2°? fractional factorial is proposed with two possible generating relations: 
i... Pe ABCD = BCE 
il, Loe ABCDE = ABCD 


a. What fraction of a 2° design will it be? 
b. Which defining relationship is preferred for the design? Explain. 


A 2?,° fractional factorial is designed with defining relationship J = ACE = BC DE. It is known 


that factors A, B, and C do not interact with one another and factors C’, D, and E do not interact 
with one another. | 

a. Which effects can be estimated ignoring three-factor and larger interactions? 

b. Is it possible to have a better 27,7 design for this situation? Explain. 


A researcher conducts an experiment with eight runs using a 2°~? fractional factorial constructed 

with the equivalencies D = AB and E = ABC. Then, the researcher conducts another eight runs 

with the same five factors but reverses the signs used in the original eight runs for each of the 

factors. — : 

a. What is the resolution of the design with the original eight runs? 

b. What is the resolution of the 16-run design? 

c. Suppose it is known at the start that 16 runs are going to be used for the experiment. Can a bet- 
ter design of 16 runs be constructed? If so, construct the design and give its properties. 


A 2%-° fractional factorial design is generated with the equivalencies D = AB, E = AC, and 
F = ABC. What is the resulting design if the factors D and E are dropped from the design? : 


A process under study involved the heat treatment of leaf springs for trucks. The assembled leaf 
spring was put through a high-temperature furnace, transferred to a forming machine where the 
curvature of the spring was formed, and then submersed in an oil quench. The measurement of 
importance was the free height of the spring in the unloaded condition. The target value of the free 
height was 8 inches. Deviations from this value were considered undesirable. An experiment was 
conducted to evaluate the effect of four factors on the deviation of free height from the target value 
in the manufacturing process. The factors considered were (A) furnace temperature; (B) heating 
time; (C’) transfer time, or the length of time to transfer the spring assembly from furnace to curva- 
ture former; and (D) hold-down time, or the time the curvature former is closed on the hot spring. 
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11. 


In addition, the engineers were interested in the two-factor interactions AB, AC, and BC as well 
as the main effects of each factor. A 2*~! fractional factorial was used for the experiment with the 
defining relationship J = ABCD. The eight treatment combinations follow with the signal-to- 
noise ratio (Z) for six springs constructed with each treatment. The signal-to-noise ratio is a mea- 
sure of the deviation of the springs from the target value. 


Run A B & D Zz 
l a a a 51 29.46 
Z — = + a 28.11 
: 3 — — — — 28.00 
4 “rr a Be = 30.59 
5 + ae + i 6 eee e 
6 5 “+ — = 38.68 
- == ae = i = Be ee 
8 = 2 ae 47.70 


Source: J. J. Pignatiello and J. S. Ramberg (1985), Journal of Quality Technology 17, 
198-206. Discussion of article by R. N. Kackar (1985). Off-line quality control, 
parameter design, and the Taguchi method. Journal of Quality Technology 17, 176— 
188. 


What is the resolution of this design? 

Show how the + and — signs were determined for factor D. 

Show the alias structure for this design. 

What must be assumed to obtain estimates of the two-factor interactions AB, 

AC, and BC? 

e. Estimate the main effects and two-factor interactions of interest and their 
standard errors. 

f. Interpret the results. 


SoS S 


The manufacturer of instant soup products wanted to produce dry soup mix packages with 
minimum weight variation among packages. Five factors were identified that might influence 
variation in the filling process. A 2°~' fractional experiment with the defining relationship 7 = — 
ABC'DE was conducted to evaluate the effects of the factors and their interactions. The factors 
and levels were (A) the number of mixer ports through which the vegetable oil was added (1 and 
3), (B) the temperature surrounding the mixer (— =cooled or + = ambient temperature), (C’) the 
mixing time (60 and 80 sec), (D) the batch weight (1500 and 2000 Ib), and (£) the days of delay 
between mixing and packaging (1 and 7). Between 125 and 150 packages of soup mix were 
sampled over an eight-hour production run for each treatment combination. The standard deviation 
for the weight of the packages was computed as a measure of variation in the filling process and 
used as the response variable y. The factor levels and the process standard deviation for each of the 
16 treatment combinations follow. 


12. 
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Run A Bey cS D E y 
] — | — | — 1] ] I 0.78 
2 l —1 | ] 1.10 
3 | ] —] — 1] 4 1.70 
4 ] —| | —] =] 1.28 
5 —] ] —] —] 0.97 
6 — | — | ] —] l 1.47 
- —] | —] | —] 1.85 
8 ] ] l | —] 2.10 
9 —] | ] | ] 0.76 

10 ] l —] ] 0.62 
1] —1 — | ] l — | 1.09 
12 — |] — 1] —] —1 —J Lt 
13 | — | —] — | ] Lao 
14 ] I — J 0.98 
is l —] — |] | —1| 1.36 
16 — 1] ] ] —] — | 1.18 


Source: L. B. Hare (1988), In the soup: A case study to identify contributors 
to filling variability. Journal of Quality Technology 20, 36-43. 


What is the resolution of this design? 

Show how the + and — signs for levels of factor E were determined for the design. 

Show the alias structure for the design. 

What assumptions must be made to estimate main effects and two-factor interactions free of 
any other effects? 

e. Estimate the main effects and two-factor interactions of interest and their standard errors. 

f. Interpret the results. 

g. Construct a quarter fraction design from the present design. 


a o 2 


A 2'-4 fractional factorial, or eight-run, Plackett-Burman design was used to study the effects of 
seven factors on oil consumption in diesel engines. Good oil control consumption is considered to 
be 0.195 grams/horsepower hour (g/hp hr). The seven engine factors each used at two levels were 
(A) top ring fit, (B) top ring twist, (C) intermediate ring face, (D) intermediate ring type, (£) 
piston crown clearance, (F’) oil ring width, and (G) use of liner tabs. The oil consumption in 
grams/horsepower hour (y) is shown for each of the eight runs. 


Run A B ee D E F y 
== = of = Oe 
2 ee ae 
3 a ee ee 
4 a ei ee gree de > Oa 
5 + - = = = es Bees 
6 Oe ee ee. , 
7 ee eS eS 3 Daas 
g - + = a " + je ase 


Source: P. R. Shepler (1975), Fractional factorial plans for diesel engine oil control and for seals. 
Proceedings of the 31st National Conference on Fluid Power, 204-234. — 
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a. What is the resolution of this design? What does the resolution of this design imply with 
regard to the estimation of factorial effects? 

What fraction of a 2° factorial is this design? 

Determine the design generators. 

Show the alias structure for the main effects of the design. 

Estimate the main effects. 

Interpret the results. 


AROS 


The objective was to find the best configuration of the seven factors to reduce oil consump- 
tion. The engineers determined that the best test was obtained with run | (0.204 g/hp hr). They also 
considered a ratio of the average response of the — levels and + levels (largest average/smallest 
average) for each factor and called it the “favor ratio.” If the largest average was from the — lev- 
els, the favor ratio was given a — sign; and if the largest average was from the + levels, the favor 
ratio was given a + sign. The favor ratio signs were the same as that for run | except for the sign 
of factor E, piston crown clearance. To check the results of their previous tests they conducted 
three additional runs. The first run, labeled “check,” included factor levels the same as those for the 
favor ratio except for two factors A and G. The second run, labeled “proof,” included all the same 
factor levels as those for the favor ratio. The third run was a repeat of run 1 with factor & at the + 
level. Their past experience indicated that large piston crown clearance (+ level) almost always 
had been better for oil control. The results of the three additional runs are shown along with the 
favor ratio. 


Run A B C D iy FP G y 
Check ae eae — a as = — 0.435 
Proof -- o =e = a = + 0.662 
Repeat | = 4 = = + + a 1.075 
Favor ratio — — a t- FF e oe 


Were the engineers’ conclusions based on the favor ratio consistent with the estimates of the 
effects from the test runs? Explain. Under what circumstances would the favor ratio be consis- 
tent with main effect estimates? 

h. Were the engineers’ suspicions about factor & correct? Explain. 


Ge 
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12A Appendix: Fractional Factorial Design Plans 


Table 12A.1 Selected 2” ” fractional factorial designs 


Number of — Experimental Design Design 
Factors Units Fraction Resolution Generator* 
3 4 5 III C= AB 
4 8 5 IV D = ABC 
5 8 , II] D2 AB 
i= AG 
16 5 V E = ABCD 
6 16 ri IV Fi = ABC 
f= pC D 
8 t I D=AB 
E=AC 
f= DC 
7 32 ; IV F = ABCD 
G=ABDE 
16 ‘ IV E = ABC 
f= BCD 
G=ACE 
8 77 I = AD 
= AC 
| es Se 
= AC 
8 64 i V G=ADECD 
; H = ABEF 
a2 ; ee ABC 
(== ABD 
= BC DLE 
16 + IV B= BCD 
F=eACD 
G = ABC 
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Table 12A.1 (continued) 


Number of Experimental Design Design 
e Factors Units Fraction Resolution Generator* 
9 16 5 III E = ABC 
F=6CD 
G = ACD 
H = ABD 
J = ABCD 
64 ; IV G = ABCD 
fi = ACELF 
J =CDEF 
32 * IV P= BODE 
G = ACDE 
H=ABDE 
J =ABCE 
10 16 4 II E = ABC 
F=BCD 
G = ACD 
H = ABD 
J = Ago 
=f 
32 #5 IV F = ABCD 
G = ABCE 
Hf =ASDE 
J = ACDE 
K =BCDE 
64 * V G = BCDF 
H = AGDB 
J = ABDE 
K = ABCE 
128 ; V H = ABCG 
J = BCDE 
Ks AC DFT 


” Either the positive or negative half of the design generators may be used to construct 
the fractional design. 


3 Response Surface Designs 


The central topic in this chapter is constructing designs for efficiently estimating re- 
sponse surfaces from factorial treatment designs with quantitative factors. The na- 
ture of linear and quadratic response surfaces is discussed, and designs developed 
specifically for response surface experiments are described. The discussions in- 
clude estimating response surface equations and methods to explore the surfaces. 
Special designs are presented for experiments with factors that are ingredients of 
mixtures. 


13.1 Describe Responses with Equations and Graphs 


The objective of all experiments includes describing the response to treatment fac- 
tors. Throughout this text when treatment factors had quantitative levels we have 
characterized the response (y) to the factor levels (x) with the polynomial regres- 
sion equation. For example, in Chapter 3 a polynomial equation was used to esti- 
mate the relationship between seed production of plants, y, and density of plants in 
the plot, x. The estimated quadratic regression equation was graphed as a curve, 
and we were able to visualize the response of seed production to plant density 
throughout the range of plant Gensities meee, | in the experiment. One of the main | 
advantanes.. of the response curve includes the ability to esc peed 
‘oughout the range of factor forele ieeludea ch he experiment. 


Response Surface Graphs for Two Treatment Factors 


The response equation can be displayed as a surface when experiments investigate 
the effect of two quantitative factors such as the effect of temperature and pressure 
on the rate of a chemical reaction. In Chapter 6 a quadratic polynomial for two 
quantitative factors, salinity of media and number of days, was estimated to charac- 
terize plant response. 
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The quadratic response equation is represented as a solid surface in the three- 
dimensional display of Figure 13.1a. The equation is displayed in Figure 13.1b as a 
contour plot with lines of equal response values similar to contours of equal eleva- 
tion levels shown on topographic maps. 


(a) Quadratic Response (b) Quadratic Response 
Surface Surface Contours 


Figure 13.1 The response equation for two factors displayed as (a) a response 
surface and (b) contours of equal response 


The response surface enables the investigator to visually inspect the response 
over a region of interesting factor levels and to evaluate the sensitivity of the re- 
sponse to the treatment factors. In certain industrial applications the response sur- 
faces are explored to determine the combination of factor levels that provides an 
optimum operating condition, such as the combination of temperature and time to 
maximize the yield of chemical production. In other applications the surfaces are 
explored to find factor-level combinations that economically improve the responses 
over current operating conditions if it is too expensive to attain optimum 
conditions. 

Response surfaces also can be used for analytical studies of fundamental pro- 
cesses. For example, they are used frequently in biological sciences to investigate 
the interplay of factors on the response variable, such as the interaction between 
nitrogen and phosphorus on the growth of plants. 


Polynomial Models Approximate the True Response 


Response surface design and analysis strategy assumes the mean of the response 
variable yz, is a function of the quantitative factor levels represented by the varia- 
bles x1, X2,...,2,. Polynomial models are used as practical approximations to the 
true response function. The true function commonly is unknown, and the poly- 
nomial functions most often provide good approximations in relatively small re- 
gions of the quantitative factor levels. 
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The most common polynomial models used for response surface-analysis are 
the linear, or first-order, model and the quadratic, or second-order, model. The 
first-order model for two factors is 


[by = Bo + Bix + Boxe (13.1) 


and the second-order model is 


[ly = Bo + Bit, + Boxe + Buixy + Box + Brox 22 (13.2) 


The contour plots for first-order models have series of parallel lines repre- 
senting coordinates of the factor levels that produce equal response values. The 
contour plots for quadratic models are more complex with a variety of possible 
contour patterns. One such pattern shown in Figure 13.1b is a symmetrical mound 
shape contour with a maximum response occurring within the central contour. Four 
other patterns are shown in Figure 13.2. Figure 13.2a demonstrates a contour plot 
with a minimum response occurring within the center contour, indicating a sym- 
metrical surface with a depression in the center. Figure 13.2b depicts a rising ridge 
with the maximum occurring outside the experimental region. Figure 13.2c shows a 
stationary ridge in the center of the plot with a decreasing response to the right or 
left of the center line of maximum response. A saddle contour plot or minimax is 
portrayed in Figure 13.2d in which the response can increase or decrease from the 
center of the region, depending on the direction of movement from the center. 


Sequential Experiments for Response Surface Analysis 


Box and Wilson (1951) accelerated the promotion of response surface analysis for 
industrial applications. Their primary theme was the use of sequential experimenta- 
tion with the purpose of determining the optimum operating conditions for an 
industrial process. 

The general approach begins with 2” factorial treatment designs to identify fac- 
tors that influence the process. Subsequent experiments use factor treatment combi- 
nations to locate an area in the factor space that most likely produces optimum 
responses. Ultimately, a 2” factorial arrangement in this region is augmented with 
treatment combinations to characterize the response surface with quadratic 
polynomials. 


Time Scales Can Prevent Effective Sequential Experiments 


In some fields of application the time scale for completion of experiments prohibits 
sequential experimentation. Many biological studies may require months to com- 
plete a single experiment. Often accumulated information from previous biological 
studies enables the investigator to identify the regions of optimum response, 
and experiments can be designed to explore the response surface in those regions. 
The objective of this chapter is to present some of the basic designs and. 
methods of analysis used for identifying optimum conditions in sequential 
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50 


‘Figure 13.2 Contour plots for a (a) minimum surface, (b) rising ridge, (c) sta- 
tionary ridge, and (d) saddle 


experimentation and to present designs for efficiently estimating the response 
surface equations when the region of optimum response is identified. 


13.2 Identify Important Factors with 2” Factorials 


Complete or fractional factorial experiments are used for the initial experiments 
conducted in the study of response surfaces. When the region of optimum response 
is unknown, the 2” factorials or fractions of the 2” factorials are used to identify 
factors that affect the response variable. (These designs were discussed in Chapters 
11 and 12.) 


Estimate Linear Responses to Factors 


The 2” factorials are suitable designs to estimate the mean responses for the /inear, 
or first-order, model in Equation (13.1). The inclusion of two or more observations 
at the middle level of all factors is the usual recommended procedure to estimate 
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experimental error and to provide a means to evaluate the adequacy of the linear 
response surface model. 


An extensive coverage of methods to identify important factors and factor lev- 


els in the region of optimum response conditions can be found in Box and Draper 
(1987) and in Myers and Montgomery (1995). A brief introduction to the identifi- 
cation of important factors is illustrated with an industrial chemistry experiment to 
evaluate the factors that affect the vinylation of methyl glucoside. 


Example 13.1 Vinylation of Methyl Glucoside 


Vinylation of methyl glucoside occurs when it is added to acetylene under 
high pressure and high temperature in the presence of a base to produce 
monovinyl ethers. The monovinyl ether products are useful for various 
industrial synthesis processes. The results of a study on vinylation of methyl 
glucoside by Marvel et al. (1969) are used to illustrate the methods to identify 
and evaluate important factors for response surface characterization. The 
ultimate goal of the project was to determine which conditions produced 
maximum conversion of methyl glucoside to each of several monoviny] 
isomers. 

~Some methods to identify important factors with first-order response sur- 
faces are illustrated with two factors, pressure and temperature. The treatment 
design was a 2? factorial with “temperature” at 130° and 160°C and 
“pressure” at 325 and 475 psi as factors. Four replications were conducted in 
the center of the experimental region at a temperature of 145°C and a pres- 
sure of 400 psi to provide an estimate of experimental error variance and to 
evaluate the adequacy of the linear response model. The treatment combina- 
tions and percent conversion of methyl glucoside are shown in Table 13.1. 


Table 13.1 Vinylation of methyl glucoside in a 2? factorial plus four replications 
at the design center with Temperature and Pressure as factors 


Original Factors Coded Factors 
Temperature Pressure qa x2 %Conversion 
130 325 —] —] 8 
160 an) +1 —] 24 
130 475 —1 +4 16 
160 475 +1 +1 32 : 
145 400 0 0 zi 
145 400 0 0 2 
145 400 0 0 20 
145 400 0 0 24 
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Coded Factor Levels for Convenience 


Coded factor levels provide a uniform framework to investigate factor effects in 
any experimental context since the actual values of factor levels depend on the par- 
ticular factors in the study. Coded levels for the 2” factorial design factors are 
_ (A; - A) 133 

mameceas 5 (13.3) 


where A; is the ith level of factor A, A is the average level for factor A; and D is 
+(A2 — A,). Coded levels of temperature (7’) and pressure (P) in Table 13.1 are 


2 ei asin _ P-400 ais 
os as 15 oe a 75 (13.4) 


Estimates of the Linear Responses 


The estimates of the coefficients for the first-order model in Equation (13.1) a 


m 1 
By =H = 7(8 + 24 + 16 + 32) = 20 


= l l 
Pi gdm ise td tee 
a | | 
bo ae aghast oe) = 


The estimates of the linear coefficients, G,; and (>, are one-half of the factorial 
treatment effect estimates for a 2? factorial (see Chapter 11). 

The variance of the four observations at the design center is s* = 3.33, and an 
estimate of the standard error for the coefficient estimates is 


4 
sg = Ee) = 0.91 


Whether the experimental error variance is adequately estimated with replication 
only at the center of the design factor levels can matter. If the variance of the re- 
sponse in any way depends on the factor level, then replication of the design at the 
high- and low-factor-level combinations is recommended to detect any hetero- 
geneous variability among the treatment combinations. 

The estimates of the regression coefficients indicate that increases in Tempera- 
ture or Pressure will increase the vinylation of methyl glucoside. The estimated 
first-order model equation is 


J = 20 + 82; + 422 
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The temperature and pressure interaction 7’P measures lack of fit to the linear 
model and is represented by the term (122,22 in the quadratic model in Equation 
(13.2). The estimate of the coefficient 3,2 is one-half of the 7’P interaction, or 


l i 


The standard error of Be is 0.91, the same as that for the linear coefficients. The 
estimated interaction component of 0 indicates that Temperature and Pressure are 
acting independently on the conversion. 


Center Design Points to Evaluate Surface Curvature 


Replicate observations at the design center not only provide an estimate of experi- 
mental error, but they also provide a means to measure the degree of curvature in 
the experimental region. Let y, be the mean of the four treatment combinations for 
the 2? factorial and 7, be the mean of the center points. There is some evidence for 

curvature: on the response surface if the average response in the center of the design 
coordinates, ¥,, is larger or smaller than the average response at the extreme levels 
of the factors, y;. The difference (y; — ¥,) is an estimate of 61; + Go2, where (11 
and (25 are the quadratic regression coefficients in Equation (13.2). The observed 
means are ¥, = 20 and y, = 22, with a difference of y; —y, = — 2. The standard 


error of the difference is estimated as 4/3334 + i) = 1.29; the linear response 


appears to adequately describe the surface in this region. 

The contour plot for the estimated linear response equation is shown in Figure 
13.3. The values of the contours ascend as the levels of Temperature and Pressure 
increase. Ascending contours indicate that a combination of Temperature and Pres- 
sure to maximize conversion may exist in a direction perpendicular to the contours. 


Path of Steepest Ascent to an Optimum Response 


Ultimately, the investigator will want to characterize the region of optimum re- 
sponse. To do so requires the investigator to locate the region of factor levels that 
produces optimum conditions. The method of steepest ascent is a procedure de- 
veloped to move the experimental region in the response variable in a direction of 
maximum change toward the optimum. 

Based on the estimated linear equation 7 = 20 + 8x; + 4x0, the path of steep- 
est ascent perpendicular to the contours of equal response moves 4 units in the x 
direction for every 8 units in the x direction. Equivalently, the path has a move- 
ment of 4/8 = 0.5 unit in x2 for every 1 unit movement in 7}. 

The path of steepest ascent is started at the center of the design with 
(x1, x2) = (0,0). The center of the design for values of temperature and pres- 
sure is (I, P) = (145, 400) in Figure 13.3. A change of Az, = 1 unit in the x 
direction is a 15°C change in Temperature and a Ax» = 0.5 unit in the x2 direction 
is a 37.5 psi change in Pressure. 
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182.5 


e Design points on path 
of steepest ascent 


o Original design points 


250 363 475 588 700 


Figure 13.3 Contour plot for the linear response of methyl glucoside vinylation, 
% conversion, to temperature (7') and pressure (P) 


The objective is to move along the path of steepest ascent until a maximum re- 
sponse is observed. The chemist will perform experiments at combinations of Tem- 
perature and Pressure along the path of steepest ascent. Suppose the chemist wants 
to make changes relative to a l-unit change in z,. The levels of Temperature and 
Pressure along the path beginning at (T, P) = (145,400), the design center, with 
l-unit changes in x, and one-half unit changes in x2 are shown in Table 13.2. 


Table 13.2 Path of steepest ascent to search for region of maximum response in 
vinylation of methyl glucoside 


Step a x2 42 P 
0 0 0 145 400.0 
] l 0.5 160 437.5 
ps Zz ro os) 475.0 
3 3 Se 190 Jt 
4 4 


Eventually, as the chemist advances along the path of steepest ascent the in- 
creases in the response become smaller until an actual decrease is observed in the ~ 
response. The decrease in response should indicate that the region of maximum re- 
sponse is in the neighborhood of the current temperature and pressure conditions. 
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At that point in the process an experiment can be designed to estimate a quadratic 
polynomial equation that approximates the response surface. 


13.3 Designs to Estimate Second-Order Response Surfaces 


A new experiment has to be designed to characterize the response surface once the 
region of optimum response is identified. The surface usually is approximated by a 
quadratic equation to characterize any curvature in the surface. 

The 2” factorials or fractions thereof are useful designs to identify the impor- 
tant factors and regions of optimum response. However, in the region of optimum 
response, these designs provide insufficient information to estimate quadratic re- 
sponse equations. At least three levels are required for each factor, and the design 
must have 1 + 2n + n(n — 1)/2 distinct design points to estimate the parameters in 
a quadratic regression model for approximation to the curved surface. 

Desirable properties for experimental designs for response surface estimation 
include the ability to estimate experimental error variance and allow for a test of 
lack of fit to the model. Designs should also efficiently estimate the model coeffi- 
cients and predict responses. 

Several classes of designs with these desirable properties that have been devel- 
oped for second-order response surface approximation are discussed in this section. 


3” Factorials for Quadratic Surface Estimation 


The 3” factorials can be used to estimate the quadratic polynomial equations. How-. 
ever, the number of treatment combinations required by the 3” factorials leads to an 

impractical experiment size. While a 3” design with two factors only requires 9 

treatment combinations, a design with three factors requires 27 and one with four 

factors requires 81 treatment combinations. 


Central Composite Designs Are an Alternative to 3” Factorials 


Box and Wilson (1951) introduced central composite designs requiring fewer 
treatment combinations than 3” factorials to estimate quadratic response surface 
equations. The central composite designs are 2” factorial treatment designs with 2n 
additional treatment combinations called axial points along the coordinate axes of 
the coded factor levels. The coordinates for the axial points on the coded factor 
axes are ( :0,07057..:,0), ©, 2a, 05,0), --, (6/0,0;..., + a). Generally, m 
replications are added to the center of the design at coordinate (0,0,..., 0). 

The central composite designs are used to advantage in sequential experi- 
mentation. The first step of the sequence consists of a series of tests conducted 
along a path of steepest ascent, such as that illustrated in Table 13.2. Eventually, 
the tests lead to a set of factor levels that provide an apparent maximum on the 
path. For example, suppose the responses on the path of steepest ascent are those 
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shown in Figure 13.4, with a maximum response of 36 observed at J’ = 190°C and 
P = 512.5 psi on the path. 


220 


----Q----> 0 


190 


@ Response on original design 


© Response on path of steepest 


ascent 


160 248 


4 New 22 factorial with center 
points 


OQ Axial points to complete 


central composite design 
130 


325 400 475 550 625 


Figure 13.4 Path of steepest ascent and a central composite design 


As a second step in the sequence the chemist can conduct a new 2? factorial 
experiment with several replications at the design center of (Z’, P) = (190, 512.5). 

Suppose the difference (Y; — Y,) computed from the new experiment indicates 
a high degree of curvature on the surface. The third step in the sequential experi- 
ment consists of additional runs of the experiment at the axial points (+ a,0) and 
(0, + a) shown in boxes in Figure 13.4. This last set of treatment combinations on 
the axes, along with the 2? factorial and center points, constitutes a central com- 
posite design as a result of the sequential experimentation. 

One replication of a central composite design consists of Ny; = 2” treatment 
combinations from the 2” factorial, N, = 2n treatment combinations at the axial 
points in the design, and m replications at the center for a total of 
N = N; +N, +m design observations. 

The coordinates on the coded x; and x2 axes for the central composite design 
with two factors are shown in Display 13.1. A graphic display of the coordinate 
locations for the coded factor levels of two- and three-factor central composite de- 
signs are depicted in Figure 13.5. A quadratic equation can be estimated from this 
design because each factor has five levels. In addition, as we shall see in the 
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Display 13.1 Central Composite Design Coordinates 


2? Design Axial Center* 


os L2 1 0 op) 
—j|] -1] —a 0 
eS es | +a 0 
—1] +1 0 —a 
+1 +1 0 ik & 
*m replications 


(a) (b) 


Figure 13.5 Central composite designs for (a) two factors and (b) three factors 


next section, any significant deviations from the quadratic approximation can be 
evaluated. 

The N = 2"+2n+™ experimental units required for the central composite 
design with n factors are fewer than those required by 3” factorials with three or 
more factors. Thus, the central composite designs are more economical in the use 
of experimental resources and provide the ability to estimate quadratic response 
equations. Fractions of the 2” designs with high-order interactions aliased can be 
used as the 2” design base when there are many factors in the study. 


Rotatable Designs to Improve Response Surface Explorations 


Equal precision for all estimates of means is a desirable property in any experi- 
mental setting. However, the precision of the estimated values on the response sur- 
face based on the estimated regression equation will not be constant over the entire 
experimental region. A property of rotatability developed for central composite 
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_ designs requires that the variance of estimated values be constant at points equally 
distant from the center of the design which is coded coordinates (0, 0,..., 0). 

Rotatability of a design becomes important in the exploration of a response 
surface because the precision of the estimated surface does not depend on the orien- 
tation of the design with respect to the true response surface or the direction of the 
search for optimum conditions. The 2” factorials used as first-order designs to 
implement the method of steepest ascent searches for regions of optimal responses 
are rotatable designs. Thus, the orientation of the design does not hinder the method 
of steepest ascent search because some responses are estimated with less precision 
than others. 

The central composite design can be made rotatable by setting the axial point 
values as a=(2")!/4. The value of a for a two-factor design is a= 
(4)'/4 = ,/2 = 1.414, and for a three-factor design a = (8)!/4 = 1.682. If there are 
ry replications of the 2” factorial and r, replications of the axial treatment com- 
binations a more general form for a is a = (r72"/ ra)! 4 If a 2”-? fractional fac- 
torial is used as the basis for the central composite design, then a = (r;2"~?/ Tayl*. 


Example 13.2 Rotatable Design for Vinylation of Methyl Glucoside 


Suppose the path of steepest ascent for the methyl glucoside study in Table 
13.2 provided a maximum response at 7’ = 190°C and P = 512.5 psi and a 
rotatable central composite design is to be constructed with the design center 
at (7, P) = (190,512.5). Also, the relationship between the design coor- 
dinates (x,, x2) and temperature and pressure levels (7’, P) remain as before 
where a change of one unit in x; is 15°C and a change of one unit in x2 is 75 
psi. With a = of 25 the design coordinates and the required temperature and 
pressure settings will be 


Axial Center 2” Design 
Li al ade 0 0 0 oS 
A 0 0 = 2 ae 0 ee 
Pits AQ acelin led 190 mS. 205 175 205 


Piss SI2S 5951255; o24Q6 A Bie $1225 HSS 437.5 S875 3587.5 
*Example calculation, 169° = 190° — \/2(15°) 


Designs for Uniform Precision in the Center of the Design 


As previously stated, the variance of the estimated surface is not constant over the 
entire surface. Box and Hunter (1957) showed that the number of center points in 
the rotatable central composite designs could be chosen to provide a design with 
uniform precision for the estimated surface within one unit of the design center 
coordinates on the coded scale. They reasoned that the investigator is most inter- 
ested in the response surface near the center of the design when a stationary point 
of the surface is located near the center of the design. The stationary point is a point 
of maximum or minimum response or a saddle point as shown in Figure 13.2d. 
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Some central composite rotatable designs with uniform precision are shown in 
Table 13.3. 


Table 13.3. Uniform precision rotatable central composite designs 


Number of Factors _ 3 4 5 5 6 6 
Fraction of 2” 1 1 eae ee ; 1 
Qu 1414 i-ceoee 42.000 = =2.378 2000 2.828 2.378 
Ny 4 8 16 aa 16 64 a2 
Na 4 6 8 10 10 12 12 
m 5 6 7 10 6 15 9 
N 13 20 31 52 Sz 9] a 


The central composite designs require five levels of each factor coded as 
— a, — 1,0,1,a. In some instances the preparation of five levels for some factors 
may be too difficult, expensive, or time-consuming. The face-centered cube design 
is a variation of the central composite design with a = 1 that requires only three 
levels of each factor. Substituting a = | into Display 13.1, the design for two fac- 
tors becomes a 3? factorial. The design is most attractive when the region of interest 
is a cuboidal region produced by the design rather than the spherical region pro- 
duced by the central composite design. 

The design is not rotatable but the absence of this desirable property may be 
offset by the desire to have a cuboidal inference region and also by the savings in 
experimental resources. The face-centered cube design requires fewer runs at the 
center point of the design than does the central composite design to achieve a stable 
variance of estimated values throughout the design region. However, it should be 
remembered that replicate runs are needed at some design point or points to esti- 
mate experimental error variance. A face-centered cube design for three or more 
factors requires fewer treatment combinations than the 3” factorials; thus, it is 
another alternative to the 3” design, requiring fewer experimental units. 


Box—Behnken Designs Another Alternative to 3” Factorials 


A class of three-level designs to estimate second-order response surfaces was pro- 
posed by Box and Behnken (1960). The designs are rotatable, or nearly so, with a 
reduction in the number of experimental units compared to the 3” designs. The de- 
signs are formed by combining 2” designs with incomplete block designs. Details 
for construction can be found in Box and Draper (1987). The coded factor levels 
for the treatment combinations required in a design for three factors are shown in 
Display 13.2. A complete set of treatment combinations for a 2? factorial occurs for 
each pair of factors accompanied by the 0 level of the remaining factors. Several 
replications of the design center (0, 0,..., 0) are included. 
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Display 13.2 Box—Behnken Design Coordinates 
for a Three-Factor Design 


Factor 
Coded Level 


2? factorial 
for A and B 


2? factorial 
for A and C 


2? factorial 
for B and C 


Design center 


These designs are spherical rather than cuboidal since the design points fall on 
the edges of a cube rather than on the corners like those of the face-centered cube 
design. The Box—Behnken design should only be used if one is vot interested in 
predicting responses at the corners of the cuboidal region. 


Incomplete Block Designs to Increase Precision 


Incomplete block designs are useful to reduce experimental error variance when the 
number of treatments is large or when the experimental conditions preclude the 
conduct of complete replications at one time or under the same conditions. 

Box and Hunter (1957) gave the conditions for blocking second-order response 
surface designs so that the block effects do not affect the estimates of the param- 
eters for the response surface equation. They showed that two conditions must be 
satisfied for the blocks to be orthogonal to the parameter estimates of the response 
surface equation. Let nz be the number of treatments in the bth block. The two con- 
ditions necessary are 


1. Each block must be a first-order orthogonal design. For each block the 
following relationship must hold for each pair of design variables x; and z;: 
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No 
ee 1 Aj =0,1,2,...,0 (13.5) 
=a 
2. The fraction of the total sum of squares for each design variable contributed by 
every block must be equal to the fraction of the total observations placed in the 
block. Thus, the following relationship must hold between the design variables and 
the number of observations for every block: 


ajc pian Ses as eenaee (13.6) 


A suggested strategy for blocking the central composite design places the Ny; 
treatments for the 2” design and my, design center points in one block and the Ng 
axial treatments with m, design center points in a second block. This blocking 
arrangement satisfies the first condition, Equation (13.5). 

The central composite rotatable design for two factors arranged into two blocks 
is shown in Display 13.3. The first block is composed of N; = 4 treatment combi- 
nations of the 2? factorial plus m; = 2 design center points, and the second block 
consists of NV, = 4 axial treatment combinations plus m, = 2 design center points. 
The computations required to evaluate the first condition for an orthogonal block 
design are the sums of crossproducts between x and x2 in each block. It is easy to 
verify that /) 2; x2 = 0 in both blocks. 


Display 13.3 Central Composite Rotatable Design for 
Two Factors in Two Incomplete Blocks 


Factor B 
Coded Level Ly XL 


Block 1 


Block 2 
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For the complete design 


The number of treatment observations in blocks | and 2 are n; = n2 = 6, and 
the total number of observations is N = 12 with a ratio n;/N = 6/12 = 1/2. The 
second condition, Equation (13.6), requires that the ratio of the sums of squares of 
x, and x in every block to that for the entire experiment be equal to n;/N. For 
both block 1 and block 2 the sum of squares ratio is 4/8 = 1/2, which is equivalent 
to the ratio for n;/N; the design is orthogonal. 

For the second condition to be satisfied the following relationship must hold: 


1+ De 
a=n a2 | (13.7) 
1 + pf 


where py = m,/N, and ps = m;/ Ny. For the design to satisfy the two conditions 
and be rotatable a = (2"r;/ r,)'/4. It is not always possible to find a design that 
exactly satisfies Equation (13.7) with a = (2"r;/ r,)'/4, but in practice values of 
the design observation numbers can be determined to provide designs with near 
orthogonal blocking and rotatability. Box and Draper (1987) provide relative pro- 
portions of rs and rq required for rotatability and orthogonal blocking when 
Pa = Pf: 

For the design in Display 13.3 the fraction of design center observations in 
each block is p, = py = 1/2 anda = ,/2. 

Evaluating the condition for rotatability and orthogonality in Equation (13.7) 


we have 
1 ee 1+1/2 
be py 1+1/2 


and a = ,/2 as required for rotatability. 

The central composite rotatable designs listed in Table 13.3 can be placed in 
useful incomplete block designs for near rotatable and orthogonal central composite 
designs. The 2” factorial or 2"-? fractional factorial is placed in one or more 
incomplete blocks and the axial treatment combinations are placed in one separate 
block. For the designs listed in Table 13.3 the number of blocks for the 2” factorial 
or fractional factorial, and a suggested number of design center points in each block 
are shown in Table 13.4. 
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Table 13.4 Incomplete block designs for near rotatable and orthogonal central 
composite designs 


Number of Factors 2 3 “ 5 5 6 6 
Fraction of 2” 1 1 l ] 5 1 ; 
Ny 4 8 16 32 16 64 32 
™ f 2 2 2 4 Za 2 2 
Number of Blocks* l 2 2 4A l 8 2 
a 414 4682 2a 2.378 = 2.000 — 2.828 - 2.378 
Na 4 6 8 10 10 12 V2 
Ma 2 2 1 1 4 l 4 


*See Appendix 11A for defining contrasts to block the 2” factorial or fractional factorial design 
portion. 


Reducing the Number of Design Points 


The expense, difficulty, or time consumption with certain types of experiments may 
necessitate reducing the experiment size. The amount of reduction is limited by the 
statistical model to estimate the response surface. The second-order response sur- 
face equation for n factors has a constant term, n linear terms, n quadratic terms, 
and n(n — 1)/2 interaction terms for a total of (n + 1)(n + 2)/2 terms. Thus, the 
minimum number of points a design could have to estimate the second-order re- 
sponse surface is (n + 1)(n + 2)/2. 

Designs have been developed to have as close to the minimum number of 
points as possible to estimate the second-order response surface. Tables of these de- 
signs or methods to construct them can be found in Box and Draper (1974), 
Roquemore (1976), Notz (1982), Draper (1985), Draper and Lin (1990), and Myers 
and Montgomery (1995). | 

Most of the designs are based on 2”~? fractional factorials augmented with de- 
sign points to estimate second-order response surface models. In most cases the de- 
signs are saturated with few or no replicated design points. An independent 
estimate of experimental error is required to test the efficacy of the response surface 
model, unless the design is replicated. In addition, the saturated designs do not al- 
low a test for lack of fit of the hypothesized second-order response surface model. 


An Evaluation of Response Surface Designs 


Myers et al. (1992) used the prediction variance of the second-order response sur- 
face equation to evaluate many of the popular second-order response surface 
designs. A design was considered superior if the variance of predicted values was 
smaller than that of other designs. 

The central composite designs were found superior in general over spherical 
regions covered by the design points (see Figure 13.5). When designs were re- 
stricted to the cuboidal regions (a = | in Figure 13.5), the resulting face-centered 
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cube design formed by the central composite design was found generally superior 
to the Box—Behnken design in the cuboidal region. 

Among the saturated designs, the designs by Roquemore (1976), Notz (1982), 
and Box and Draper (1974) were found quite efficient relative to others that were 
evaluated. 

Myers and Montgomery (1995) presented design efficiencies for estimating 
model coefficients and for prediction variances in a spherical region. Their general 
conclusions were that central composite and Box—Behnken designs were quite effi- 
cient as were some of the saturated designs from Roquemore (1976). 


13.4 Quadratic Response Surface Estimation 


When the supposed region of optimum response has been identified by the method 
of steepest ascent or other methods of experimentation it is often necessary to char- 
acterize the response surface in that region of the factors. Using the designs de- 
scribed in the previous section, experiments can be conducted to obtain data for 
estimating a quadratic approximation to the response surface. 

The estimated response equation will enable the researcher to locate a station- 
ary response point that could be a maximum, a minimum, or a saddle point on the 
surface. An examination of the contour plot will indicate how sensitive the re- 
sponse variable is to each of the factors and to what degree the factors interplay as 
they affect the response variable. 


Example 13.3 Tool Life Response to Lathe Velocity and Cutting Depth 


A new cutting tool available from a vendor was going to be used by a com- 
pany. The vendor claimed the new model tool would reduce production costs 
because it would last longer than the old model; thus, tool replacement cost 
would be reduced. The life of a metal cutting tool is dependent on several 
operating conditions, including the speed of the lathe and the depth of the cut 
made by the tool. 

The plant engineer had determined from previous studies that maximum 
tool life was achieved for the current tool with a lathe velocity setting of 400 
and a cutting depth setting of 0.075. The engineer wanted to determine the 
optimum settings required for the new tool. A central composite design was 
used for an experiment to characterize the life of the new tool under varying 
lathe speeds and cutting depths within the region of current optimum operat- 
ing conditions for maximum tool life. The data from the experiment are 
shown in Table 13.5. 
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Table 13.5 Observed tool life from a factorial experiment with lathe speed and 
cutting depth treatment factors in a central composite design 


Original Factors Coded Factors 
Lathe Speed Cutting Depth Ly x9 Tool Life 
600 0.100 ee + J 154 
600 0.050 +4 —] 132 
200 0.100 ae. fF 4 166 
200 0.050 — | —] 83 
683 0.075 /2 0 156 
1a 0.075 — /2 0 144 
400 0.110 0 f2 166 
400 0.040 0 -/2 9] 
400 0.075 0 0 167 
400 0.075 0 0 175 
400 0.075 0 0 170 
400 0.075 0 0 176 
400 0.075 0 0 156 
400 0.075 0 0 170 


The Estimated Response Surface Equation 


The second-order response surface model of Equation (13.2) is fit to the data by 
least squares regression procedures. The equation can be estimated by any com- 
puter program for regression analysis. A brief account of least squares estimation 
for regression models is given in Appendix 13A.1. A detailed presentation of re- 
gression analysis can be found in Rawlings (1988). The estimated second-order re- 
sponse surface equation for tool life from the data in Table 13.5 is 


G = 169 + 6.7472 + 26.385x2 — 10.875x7 — 21.6252? — 15.2502, 22 


Sum of Squares Partitions for the Regression Analysis 


The sum of squares partitions in the analysis of variance for the regression model 
are shown in Table 13.6. The sum of squares for the full second-order model is 


SS R(x, 22, £7, £5, 2122) = 10,946 


The regression sum of squares is partitioned into reductions for linear and 
quadratic components of the model using the principle of reduced model and full 
model sum of squares partitions. 

The partition for the linear components of the model, x; and x», or 


SSR(a, £2) — 5,933 
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Table 13.6 Analysis of variance for quadratic response surface model 


Source of Degrees of Sum of Mean 

Variation Freedom Squares Square 

Total 13 £1,317 

Regression D 10,946 2,189.2 
Linear (21, £2) 2 5,933 2,966.5 
Quadratic (x?, 72, 7122) 3 5,013 1,671.0 

Error 8 a7] 46.4 
Lack of Fit 3 111 37.0 
Pure Error 5 260 52.0 


is the regression sum of squares for the reduced first-order model y= 
Bo + F121 + Bota + e. The partition for the quadratic components is the difference 
between the regression sum of squares for the full model and the reduced model, or 


SSR(a?, x2, 2122 | 21,22) = 10,946 — 5,933 = 5,013 


The sum of squares for error, SSE = 371, is partitioned into two parts. The sum of 
squares for pure experimental error, SSE(pure error) = 260, with 5 degrees of 
freedom is computed from the six replicate observations at the center of the design 
with factor coordinates (V, D) = (400, 0.075). The remaining 3 degrees of free- 
dom sum of squares for error, S'S E(Lack of fit) = 111, can be attributed to error in 
specification of the response surface model, referred to as lack of fit, or it can be 
attributed to experimental error. Since the six center points of the design provide an 
estimate of pure experimental error, the sum of squares designated as lack of fit can 
be used to test the significance of lack of fit to the quadratic model. 


Tests of Hypotheses About the Second-Order Model 


The hypotheses of interest in the analysis are 


e Significance of the complete second-order model: 


Ho: Bi = Bo = Bu = Boe = Biz = 0 
2,189.2 


F 
52 


= 42.1 


Reject Ho since Fo > F'os,5,5 = 5.05 
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e Significance of linear components for the model: 
Ao: 2; = Bz = 0 


_ 2,966.5 
Theres 


0 = 3570 Reject Hp since Fo > F'05,2,5 = 5.79 


e Significance of quadratic deviations from the linear model: 


Ho: Bu = Po = Biz = 0 
1,671 
nines? 


Fo a P| Reject Ho since Fo > F’05,3,5 me SAY 


e Significance of lack of fit to the quadratic model: 


Sie 
Pas iT 0.71 Do not reject Ho since Fo < F'o5 3,5 = 5.41 


The complete quadratic regression model is significant, and the lack of fit to the 
quadratic model is not significant; thus, we can conclude the second-order model is 
an adequate approximation to the true response surface. A contour plot of the qua- 
dratic response surface model depicted in Figure 13.6 shows a maximum surface 
with a maximum tool life in the middle of the center contour. 


0.75 
0.00 
—-0.74 


—1.50 


Figure 13.6 Response surface contour plot for the tool life experiment response 
equation ¥ = 169 + 6.7472, + 26.385x2 — 10.875x? — 21.62522 — 15.2502 2% 
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The coordinates of the contour plot are displayed for the coded values of the 
two factors. The orientation of the contours indicates some interaction between 
lathe speed x, and cutting depth x2. For example, a constant cutting tool life of 150 
can be maintained for faster lathe speeds, increasing x, by decreasing the cutting 
depth, decreasing 72. 

The contours also indicate the relative sensitivity of tool life to the coded factor 
levels x; and x2. The tool life contours increase more rapidly toward the maximum 
on the coded cutting depth axis x than they do on the coded lathe speed axis 7. 


13.5 Response Surface Exploration 


The significant quadratic equation and the contour plot of the equation have given 
us a general picture of the relationship between tool life and the two design factors, 
lathe speed and cutting depth. 

Estimates of the coordinates for the stationary point on the surface and an esti- 
mate of the response at the stationary point provide a more specific characterization 
of the response surface. Sometimes, it is useful to know the direction and amount of 
change to make in one or more of the design factor levels to achieve the maximum 
change in response. 

A more specific characterization of the sensitivity of response to the design 
factors can be achieved with the canonical form of the response equation. The loca- 
tion of coordinates for the stationary point and derivation of the canonical form of 
the response equation requires some knowledge of calculus and matrix algebra. 
However, the results of the computations are understandable when they are dis- 
played in graphic form on the contour plot of Figure 13.6. 


Locating Coordinates for the Stationary Point of the Response Surface 


The x; and x2 coordinates for the stationary point are obtained from the partial 
derivatives of the estimated response function with respect to x; and x». The esti- 
mated response for tool life is 


G = 169 + 6.7472 + 26.3852 — 10.8752? — 21.62523 — 15.2502, 22 (13.8) 


The partial derivatives are set equal to 0: 


OU’ a 
oo 2, ou 
Oz] Ox 
to produce the equations 
2( — 10.875)a, + (— 15.250)xz2 = — 6.747 
(— 15.250)a, + 2( — 21.625)a. = — 26.385 
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The solutions of the equations for x, and x2 are 7;, = — 0.156 and %, = 0.665. 
These values are the coordinates for the maximum response on the surface at the 
stationary point indicated on Figure 13.6. 

The estimated response at the stationary point is found by substituting 
Lis = — 0.156 and £2, = 0.665 into Equation (13.8); the estimated stationary point 
response is 


Y, = 169 + 6.747( — 0.156) + 26.385(0.665) — 10.875( — 0.156)? 


— 21.625(0.665)? — 15.250( — 0.156)(0.665) = 177.25 


Given x; = (V — 400)/200 and x, = (D — 0.075) /0.025, the values of lathe speed 
(V ) and cutting depth (D) at the stationary point are 


V = — 0.156(200) + 400 = 368.8 


and 


D = 0.665(0.025) + 0.075 = 0.092 


_ The general solution for a stationary point with any number of 2; variables in the 
response equation is given in Appendix 13A.2. 


The Canonical Analysis to Simplify the Quadratic Equation 


The canonical form of a quadratic equation is an effective aid to visualize the sur- 
face and to determine the relative sensitivity of the response variable to each of the 
factors. It is difficult to visualize the surface by examining the estimated coeffi- 
cients for the normal form of the quadratic response equation. Likewise, it is diffi- 
cult to determine the changes in factor levels necessary to produce a specified 
change in the response. : 

The canonical analysis rotates the axes of the x; variables to a new coordinate 
system, and the center of the new coordinate system is placed at the stationary re- 
sponse point of the surface. The canonical form of the equation for two variables is 


G=9,+ AMZ + Zs (13.9) 


where Z; and Z, are the rotated axes’ variables. Notice only the quadratic terms of 
the canonical variables Z, and Z, are included in the canonical form of the re- 
sponse equation. An outline of the computations required to obtain the canonical 
form for the tool life response equation is given in Appendix 13A.3. 

The canonical form for the tool life response equation is 


9 = 177.25 — 25.5827 — 6.9223 (13.10) 


where the center of the new coordinate system is located at 1; = — 0.156 and 
2 = 0.665 in the original coordinate system shown in Figure 13.6. The 
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relationship between the two coordinate systems was determined (Appendix 13A.3) 
to be 


Z; = 0.46032, + 0.8877x2 — 0.5185 
Zo = 0.88772, — 0.4603 r2 + 0.4446 (13.11) 


Notice the Z; and Z, canonical axes are oriented with the contours of the surface. 
The sizes and signs of the \; indicate the type of quadratic response surface that has 
been estimated. 

The ; coefficients for the tool life surface are A; = —25.58 and 
A> = — 6.92. Examination of the surface in Figure 13.6 reveals that any movement 
away from the center of the Z,, Z. coordinate system results in a response de- 
crease. Thus, when all \; coefficients are negative the surface is a maximum sur- 
face such as that for the tool life surface in Figure 13.6. 

If the A; coefficients are positive, then any movement from the center of the 
Z,, Z. coordinate system results in a response increase and the surface is a mini- 
mum surface as shown in Figure 13.2a. If one coefficient is positive and the other 
negative, say \, > 0 and A2 < 0, then movement away from (0,0) along the 2; 
axis results in an increased response and movement along the Z» axis results in a 
decrease. Thus, the surface is a saddle or minimax at the stationary point as shown 
in Figure 13.2d. If one of the A; = 0, the surface is a stationary ridge (Figure 13.2c) 
because the response will not change along the Z; axis. 

The lengths of the principal axes of the ellipses formed by the contours are 
proportional to | A; |~1/2. For the tool life surface | — 25.58 |~!/? = 0.20 and 
| — 6.92 | 1/2 = 0.38, and the fitted surface is attenuated along the Z axis as seen 
in Figure 13.6. 

Suppose, for illustration, that lathe speed and cutting depth for maximum tool 
life at coordinates x; = — 0.156 and zx» = 0.665 were impractical. The least 
change in tool life as lathe speed and cutting depth change is exhibited on the sur- 
face along the Z axis direction when Z, = 0. The x; and z2 coordinates along the 
Z> axis when Z, = 0 can be obtained from the first equation in Equations (13.11). 
The least loss in tool life can be found on settings corresponding to values of 2x; 
and x that satisfy 0.46032, + 0.8877z2 — 0.5185 = 0. 

The coefficients of the x; in Equations (13.11) can provide information about 
the relationships of lathe speed and cutting depth to tool life. Consider the 
coefficients for the second equation relating Z, to xz, and 2, Z2= 
0.8877x, — 0.46032, + 0.4446. The pair of coefficients (0.8877, — 0.4603) indi- 
cate a compensation between lathe speed and cutting depth on tool life. An increase 
in lathe speed, to some extent, can be compensated for by a decrease in cutting 
depth along the elongated Z, axis. 

The estimated response equation in its original form or in the canonical form is 
only valid for the region of factor levels included in the experiment. Any attempt to 
estimate the tool life outside of the limits bounded by lathe speeds of 117 and 683 
and cutting depths of 0.04 and 0.11 could be quite misleading. An entirely different 
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response model may be necessary to describe tool life outside of the region used by 
the current study. 


13.6 Designs for Mixtures of Ingredients 


Some treatment designs involve two or more factors that are ingredients of a mix- 
ture in which the percentages of the ingredients must sum to 100% of the mixture. 
Therefore, the levels of one factor are not independent of other factor levels. 

Many food products, construction materials, and other commercial products 
are formed from mixtures of two or more ingredients in a recipe. Some examples 
are 


e fabrics with a blend of cotton and polyester fiber 


fruit juice blends of orange, pineapple, and apple juices with water 
e concrete formed from water, aggregate, and cement 


fertilizer formulations of nitrogen, phosphorus, and potassium 


This section includes a brief introduction to selecting designs and estimating 
response surface equations for mixture experiments. Cornell (1990) provides an in- 
depth coverage of design and analysis of mixture experiments. 


Factor Levels Are Proportions of Ingredients 


Variation in the proportions of the ingredients in mixtures can affect the properties 
of the end product. Investigations with mixture experiments concentrate on the rela- 
tionship of the measured response variable to the relative proportions of the sepa- 
rate ingredients present in the product rather than on the total amounts of the 
factors. 

If x1, Z2,..., 2% are the variables representing the proportions of the k ingredi- 
ents or components of the mixture, the values of the x; are constrained such that 


O= 2. = 1 Ape te (13.12) 


and the proportions of the k ingredients in the mixture sum to unity, or 


M4 


U;= 2, +%o+---+ ay = 1 (13.13) 
iz} 

If the proportion of one ingredient is x; = 1, then the other ingredients are ab- 
sent from the mixture and the product is a pure or single-component mixture. For 
example, a two-component mixture experiment with cotton and polyester fabric 
blends represented by the proportions x; and x2 may have pure cotton fabric, in 
which case 2; = 1 and rz, =0, or a pure polyester fabric, where xz, = 0 and 
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xo = 1. The allowable values of x; and x2 for a two-component mixture design are 
coordinate values along the line x; + x2 = 1 (shown in Figure 13.7). 


i 


X= 1 


(0, 0) 


xX, =1 


nn ee a EEE EEE ESE 


Figure 13.7 Factor space for a two-component mixture, 7) + Z2 = | 


Depict the Factor Space with a Simplex Coordinate System 


The coordinate values for a three-component mixture design are the coordinate 
values found on the plane defined by x; + +2 + 3 = | in Figure 13.8a. The geo- 
metric description of the factor space for k components is that of a simplex in 
(k — 1) dimensions. The two-dimensional simplex coordinate system for the three- 
component mixture design is shown in Figure 13.8b as an equilateral triangle. 


x=1 (a) (b) 
(1, 0, 0) 


x,;=1 
(1, 0, 0) 
Xo + X3=1 
Ka (0, 1, 0) (0, 0, 1) 
(0, 0, 1) Xo = 1 X3 = 1 
Xo = 1 
(0, 1, 0) 


Figure 13.8 Factor space for (a) a three-component mixture, 7; + r2 + 73 = | 
and (b) a three-component simplex coordinate system 
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The vertices of the triangle represent single-component mixtures with one 
x; = 1 and all others equal to 0. The sides of the triangle represent design coor- 
dinates for two-component mixtures with one x; = 0. Design coordinates in the tri- 
angle interior represent three-component mixtures with x, >0, z2 > 0, and 
x3 > 0. Any combination of component proportions for a mixture experiment must 
be on the boundaries or inside the triangle of coordinates in Figure 13.8b. 

The axes of the x; variables in the three-variable simplex coordinate system are 
shown in Figure 13.9. The axis for component 7 is the line from the base point 
x; = 0 and xz; = 1/(k — 1) for all other components 7 #7 to the vertex, where 
xz; = 1 and z;=0 for 7 #72. For example, with the three-component design in 
Figure 13.9, the x, axis extends from the base coordinate (0, 4,5) to the vertex 
coordinate (1, 0, 0). 


x, =1 
(1, 0, 0) 


(0, 1, 0) (0, 5,5) (0, 0, 1) 
Xo = 1 X3=1 


Figure 13.9 Simplex coordinate axes for a three-component mixture with design 
coordinates for a {3,2} simplex-lattice design 


Treatment Designs for Mixtures 


Simplex-Lattice Designs 


The array made up of a uniform distribution of design coordinates on the sim- 
plex coordinate system is known as a /attice; see Figure 13.9. The simplex-lattice 
designs consist of a lattice of design coordinates constructed to enable the estima- 
tion of polynomial response surface equations. 

The designation {k,m} is used for a simplex-lattice design with k components 
to estimate a polynomial response surface equation of degree m. For example, a 
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{3,2} simplex-lattice design has three components in the mixture design to 
estimate a quadratic response surface equation. 

The proportions of each component included in a {k,m} simplex-lattice 
design are 


py ey (13.14) 


The design consists of all possible combinations of those levels of the z;, where 
»i x; = 1 for any combination of proportions. 

The combinations of mixture proportions shown at each e in Figure 13.9 are 
the coordinate values for a {3,2} simplex-lattice. The proportions for each x; with 
m= 2 mea, = and 1. It can be seen in Figure 13.9 that x; = 1 for each 
design point. 

The {k,2} simplex-lattice design to estimate quadratic response surface equa- 
tions only has mixtures on the boundaries of the coordinate system with one or 
more of the components absent from the mixture. The general {k,m} simplex- 
lattice will consist of single-component mixtures, two-component mixtures, and so 
forth up to mixtures consisting of at most m components. If m = k, there will be 
one mixture at the centroid of the coordinate system in the experiment that contains 
all mixture components. For example, the {3,3} simplex-lattice would include the 
mixture with component proportions (4,4, 4) as well as the single-component 
mixtures and the two-component mixtures with proportions 7 and 3 for the two 
components. 


1 
re 99 


Simplex-Centroid Designs 


The simplex-centroid design is a design on the simplex coordinate system that 
consists of mixtures containing 1,2,3,..., or k components, each in equal 
proportions. Consequently, there are k& single-component mixtures, all possible 
two-component mixtures with proportion for each component, all possible three- 
component mixtures with proportion } for each component, and so forth up to one 


k-component mixture with proportion for each component. The mixtures for the 
simplex-centroid design are contrasted with the mixtures for the {3,2} and {3,3} 
simplex-lattice designs in Table 13.7. 


Augmented Simplex-Centroid Designs 


The mixture combinations for the simplex-lattice and simplex-centroid designs lie 
on the edges of the simplex factor space with the exception of one centroid point 
that contains all mixture components. More complete mixtures are possible by aug- 
menting the simplex-centroid design with mixtures on the axes of the simplex fac- 
tor space. 

The design points are positioned on each axis equidistant from the centroid 
toward the vertices. A k-component design will have k additional design points 
with coordinates 
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He 2 aie TLE 
(k +1) 1 1 
2k 2k 2k 
1 (+h |. 1 
2k 2k 2k 
1 1 ee 
2k 2k 2k 


The addition of the axial points will provide a better distribution of information 
throughout the experimental region. The three additional design points required 
by augmenting the simplex-centroid design for three components are (G, E 5)» 
(4, 2,4), and (2, 4, 2). The complete design is depicted in Figure 13.10. 


Table 13.7 Simplex-lattice designs and a simplex-centroid design for a three- 
component mixture 


{3,2} Lattice {3,3} Lattice Centroid 
ZL X2 L3 L1 L2 L3 Wo Z2 L3 
l 0 0 l 0 0 1 0 0 
0 l 0 0 1 0 0 1 0 
0 0 1 0 0 ] 0 0 ] 
1 1 2 I 1 1 
5 7 0 3 3 0 5 5 0 
1 1 2 ! 1 1 
2 0 2 3 0 3 2 0 2 
| 1 1 2 ! 1 1 
0 2 2 0 3 3 0 2 2 
1 2 1 I 
3 3 0 ee 3 
1 2 
3 0 3 
l 2 
0 3 3 
1 1 1 
iG 3 3 
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xX, =1 
(1, 0, 0) 


(0, 1, 0) (0, 5.5) (0, 0, 1) 
Xo = 1 X3=1 


Figure 13.10 Augmented simplex-centroid design for a three-component mixture 
experiment 


Pseudocomponents for Ingredients with Lower Bounds 


Many mixtures require all components to be present in at least some minimum pro- 
portions. Clearly, concrete requires some minimum proportions of water, cement, 
and aggregate. Lower bounds, L;, on component proportions impose the constraint 


on the component proportions. Suppose the lower bounds for cement (z,), water 
(x2), and aggregate (x3) are 


0.10 < 2 O20 = £5 0.30 < 23 


and a {3,2} simplex-lattice design is going to be used for the experiment. The 
lower bounds on the component proportions limit the design to a subregion of the 
original factor space on the simplex shown in Figure 13.9 or Table 13.7. 

To simplify the construction of the design coordinates a set of pseudocom- 
ponents are constructed by coding the original component variables to a simplex 
coordinate system for the pseudocomponent variables Z; with constraints 
0 < %, < 1. If the lower bound for component 7 is L; and L = L;,, then the 
pseudocomponent %; is computed as 
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A design in the original components can be constructed on the basis of coordinates 
for the pseudocomponents that are set up in a regular simplex with © %; = 1. The 
proportions of the original components required for mixtures in the experiment can 
be derived by the reverse transformation 


xi=L,+ ¢(1 — L) 


For the concrete example the lower bounds were L; = 0.10, Lz = 0.20, and 
L3 = 0.30 with sum L = 0.10 + 0.20 + 0.30 = 0.60. The pseudocomponents are 


a= 010 Lo — 0.20 3 ae BO 


0.40 oe eT aks 


Y= 


8 
| 


and the transformations back to original component proportions from the pseudo- 
components are 


xy =0.10+0.407, 2x2 =0.20+040% 23 = 0.30+4+ 0.40%; 


The complete design for the concrete mixture experiment is shown in Table 13.8 
with values for the pseudocomponent coordinates and the original components on 
the subregion of the original simplex. 


Table 13.8 Pseudocomponent and original component coordinates in a {3,2} 
simplex-lattice design for the concrete mixture experiment 


° 


Pseudocomponents Original Components 
Ue. ae eee ae Cement Water Aggregate 
ee ee ee oe 

O1G . 0.60 0.30 

C10). 020 0.70 

0.30 0.40 0.30 

0.30 0.20 0.50 

0.10 0.40 0.50 


Le. 
2 
NO 
8 
WwW 


Onlkwole Oo CO — 
nie Onl Oo — © 
NIK ni © —_—- © Oo 


13.7 Analysis of Mixture Experiments 


Canonical Polynomials to Approximate Surfaces 


The general form of the polynomial function used to approximate linear response 
surfaces is 


= Bo + Biri +--+ + Bere (13.16) 


The restriction on mixture components, 2; + 22+ . 0, = 1, creates -2 
dependency among the z; in the linear function. Multiplying { by 
(x; + Zo +---+ 2%) = 1 provides a reexpression of the model as 
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k k 
by = Bo (So ai) + > Bins = Bier + Bm t--+ Bir, 13.17) 
i=1 


i=1 


where 67 = fo + f;, t= 1,2,... , k. The reexpressed equation with parameters (37 
is known as a canonical polynomial. The canonical polynomial and the original 
polynomial are equivalent because one is derived from the other and the degree of 
the polynomial and the number of components are unchanged upon reexpression. 
The quadratic polynomial function used to approximate response surfaces is 


ae 7 k 
by = Po+ =. Bixi + = Bias a pa >. PijXiX; (13.18) 
1=1 2s oY 


i=1 


The quadratic canonical polynomial produced by enacting the restriction iz; = 1 
is 


k 
Haciacehen Hetiste Sc Yo, Bate; (13.19) 
i=1 ae 


where 67 = fo + 6; + 6 and fj, = B;; — Bi; — Bj;. The new parameters of the 
quadratic canonical polynomial for three-mixture components expressed in terms of 
the original polynomial parameters are 


By = Bot fit Pu By = Bo + Bo + Boe B3 = Bo + B3 + Bs 
Bio = Pi2 — Bir — Pre Bi3 = Biz — Pir — B33 ~~ 83 = Bo3 — Boo — ss 


The interpretation of the canonical polynomials is illustrated with a mixture 
experiment on gasoline component blends. 


Example 13.4 A Gasoline-Blending Mixture Experiment 


The octane of a gasoline blend depends upon the proportions of the various 
petroleum components blended to produce the fuel. The objective of most 
gasoline-blending studies is to develop a linear blending model to determine 
the most profitable blend of gasoline components. Coefficients in the linear 
blending model, referred to as blending values, describe the blending be- 
havior of a given fuel component. However, the blending composition de- 
pends on a number of factors including the quality of the components. Thus, 
the linearity of the blending components is lost, and a more complex qua- 
dratic or blending component interaction model must be considered. 

The analysis of a mixture experiment is illustrated with a gasoline- 
blending mixture experiment. To evaluate the need for an interaction blend- 
ing model the experiment was designed to estimate the quadratic canonical 
polynomial. 
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A mixture experiment was set up to evaluate the effect of three com- 
ponents on the octane ratings of gasoline. The components alkylate (A), light 
straight run (B), and reformate (C’), were used in a simplex-centroid design 
with seven mixtures. The octane ratings were determined for two replicate 
runs of each mixture. The octane ratings for each of the mixtures are shown 
in Table 13.9. 


Table 13.9 Octane ratings from a mixture experiment on gasoline blends 


Components* - 

oe #2 T3 Yij Vi. 

I 0 0 106.6, 105.0 105.80 
0 l 0 83.3, 81.4 82.35 
0 0 I 99.4, 91.4 95.40 
: ; 0 94.1, 91.4 O2.75 
, 0 ; 101.9, 98.0 99.95 
0 ; = 92.3, 86.5 89.40 
5 } t 96.3, 91.7 94.00 


“x, = alkylate, x. = light straight run, x3 = reformate 
Source: R. D. Snee (1981), Developing blending models for gasoline and other mixtures. Techno- 
metrics 23, 119-130. 


Estimating the Quadratic Canonical Polynomial Response Surface Model 


For convenience the asterisk will be dropped from the coefficients in the canonical 
polynomial equations. The full quadratic canonical polynomial model for the 
gasoline-blending experiment is 


Yij = P12; + Pox; + 32033 + G1271j;%2; + 81321503; + Bo3%0;03; + €:; 
a ee (13.20) 


_ where the experimental errors e;; are assumed to be independent and normally dis- 
tributed with mean 0 and variance o?. Also, t = 7 mixture treatments and r = 2 
replications per mixture produce a total of N = rt = 14 observations. 

The hypothesis of initial interest is whether the response depends on the mix- 
ture components according to the quadratic model. When the null hypothesis is true 
the mean response is adequately described by the reduced model y;; = {o + e;;, 
where J; = 62 = 83 = fo and By2 = B13 = fo3 = 0. 

An analysis of variance of the data in Table 13.9 for the mixture treatments in 
a completely randomized design will provide an estimate of pure experimental er- 
ror. The sum of squares among the mixtures is 
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t 
oot =t y = oy = 60932 
=I 


with (t — 1) = 6 degrees of freedom. The sum of squares for experimental error is 
t rT 
S8h=) + G97 =n 
i=l j=l 


with (V — t) = 7 degrees of freedom. Thus, the mean square for experimental er- 
ror is MSE = 10.53 with 7 degrees of freedom. 

The least squares estimates of the parameters for the canonical polynomials re- 
quire a fit to a regression model without the usual intercept term {o. Many com- 
puter regression programs have the capability to fit the regression model without 
the intercept term, and they will provide the correct least squares estimates of the 
parameters for the canonical polynomials. 

The estimated full quadratic equation is 


Y = 105.82; + 82.349 + 95.4%3 — S.1ay ro — 2.49173 4+2.3%2%3  =((13.21) 


with experimental error sum of squares SSE; = 73.76 with 14 — 6 = 8 degrees of 
freedom. 


Tests of Hypotheses About the Model 


A Test for the Complete Model 


If the response does not depend on the mixture components, the fully reduced 
model is y;; = So + e;; and the surface has a constant height. The experimental er- 
ror sum of squares for this reduced model is 


t 


SSE, =>, > (yj — 9) = 743.05 
2S 


=1 j= 


with N — 1 = 13 degrees of freedom. 
The sum of squares reduction for the full quadratic response surface model is 


SSR = SSE, — SSE; = 743.05 — 73.76 = 669.29 


with 13 — 8 = 5 degrees of freedom. The sum of squares for the quadratic model 
accounts for 5 of the 6 degrees of freedom for treatments with 1 degree of freedom 
remaining for lack of fit to the quadratic model. The analysis of variance is sum- 
marized in Table 13.10. ) 

The null hypothesis for the quadratic response equation is Ho: 2; = Bo = 
G3 = Bo and (i. = fi3 = Go3 =0. The test statistic Py = MSR/MSE = 
133.86/10.53 = 12.71 exceeds the critical value of F'o5,5,7 = 3.97, and the null hy- 
pothesis is rejected. 
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Table 13.10 Analysis of variance for mixture experiment with gasoline blends 


Source of Degrees of Sum of Mean 
Variation Freedom Squares Square 
Treatments 6 669.32 114355 
Regression : 669.29 133.86 
Lack of fit l 0.03 0.03 
Error 7 73.74 10.53 


A Test for the Quadratic Terms 


The investigator would want to know whether the full quadratic model is nec- 
essary to approximate the response surface or whether the linear surface is adequate 
to explain the relationship between octane rating and the component mixtures. A 
test of the null hypothesis Ho: G12 = (213 = {23 = 0 will determine whether the qua- 
dratic components of the model are necessary. The full model and reduced model 
principle can be used to determine the significance of the quadratic components. 
The sums of squares for experimental error are required from the full quadratic 
model and the reduced linear model y;; = 6121; + 62%; + B3x3; + e:;. The esti- 
mated reduced model linear equation is 


y = 103.19; 482. 1a5 + 95.57 (13.22) 


with experimental error sum of squares SS F);, = 77.37 with 14 — 3 = 11 degrees 
of freedom. The sum of squares reduction due to the quadratic terms after the linear 
terms are fit is 


SS h(quadratic) = SS Ey, — SSE; = 77.37 — 73.76 = 3.61 


with 11 — 8 =3 degrees of freedom with MS R(quadratic) = 3.61/3 = 1.20. The 
test statistic Fo = MS R(quadratic)/MSE = 1.20/10.53 = 0.11 with critical re- 
gion Fo > F'953,7 = 4.35 is not significant, and the quadratic terms do not improve 
the approximation of the model to the mixture response surface. Thus, a linear 
blending model is adequate for this set of blending components. 


A Test for the Linear Terms 


Since the quadratic components of the model account for very little of the re- 
gression sums of squares it is fairly obvious the linear terms account for most of the 
sum of squares for the regression model. A formal test of the null hypothesis Ap: 
GB, = Bo = B3 = Bo is derived from the sum of squares reduction for the linear 
model as : | 


SS A(linear) = SSE, — SS Ejin = 743.05 — 77.37 = 665.68 
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with 13 —11=2 degrees of freedom with MS R(linear) = 665.68/2 = 332.84. 
The test statistic is Fo = MSR(linear)/MSE = 332.84/10.53 = 31.61 with 
critical region fo > F'o5.9.7 = 4.74, and the null hypothesis is rejected. 


Interpretations for the Estimated Response Equation 


The estimated linear canonical polynomial, 7 = 105.12, + 82.1ro + 95.523 in 
Equation (13.22), provides a significant and adequate fit to the mixture response 
surface. The estimated standard error for each of the B, determined from the regres- 
sion program is s_ = 1.89, and the individual coefficient estimates are significant 


by the Student ¢ test, tp = GB, ‘ SB with 7 degrees of freedom. 


The coefficient @, is the estimated response at the vertex of the simplex design 
representing the mixture with 100% of that component or the single-component 
mixture. Alternatively, it represents the estimated response at the maximum value 
for that component. The variables for Example 13.4 were the proportions 
x, = alkylate, x2 = light straight run, and x3 = reformate. For example, with 
100% alkylate and 0% each of light straight run and reformate the estimated octane 
rating is ¥ = 105.1. Likewise, with 100% light straight run the estimated octane is 
Y = 82.1, and with 100% reformate the estimated octane is 7 = 95.5. The estimated 
linear surface is depicted in Figure 13.11. 


A 
y= 105.1 


A 
ras y=95.5 


A 
y= 82.1 


Figure 13.11 Estimated linear surface for the gasoline-blending mixture 
experiment 


The (3;; describe the departures from the linear response surface. An illustration 
of a quadratic response for a two-component system is depicted in Figure 13.12. If 
the two components are additive, with a linear blending of the two components the 
mean response is pj) = (121 + Goxr2 (shown as the straight line blending in Figure 
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(13.12). A nonlinear quadratic blending of the two components with 2 > 0 is 
shown by the response curve fig = (121 + Go%2 + Bi2%\22. The coefficient 3; 
represents the height of the curve when z; = | and x2 = 0, and (> represents the 
height of the curve when x; = 0 and z2 = 1. The (22,22 term contributes to the 
response whenever z; > 0 and x2 > 0. The maximum departure from the linear 


blending occurs at x; = x2 = $ when (12%122 = $12. The blending of the two 
components in Figure 13.12 is said to be synergistic because the response for the 
lL: 1 piintose at ap = oo = 5 exceeds the simple average of the pure mixtures de- 


picted by the linear blending line. If the coefficient G,;. were negative the nonlinear 
blending line would fall below that of the linear blending line and the components 
would be antagonistic to one another. 


f Ha = ByX, + Box + ByoX1 Xo 


By 


Bo 


y= By + Box% — 


2 
2 
p! 1 
2 


Figure 13.12 Comparison of linear and nonlinear blending for a two-component 
mixture system 


EXERCISES FOR CHAPTER 13 


1. A 2° factorial experiment was conducted to determine whether the volume of two reagents had an 
effect on the ability of an assay method to measure levels of a specific drug in serum. Two 
replications of the treatments were tested in a completely randomized design with two additional 


460 CHAPTER 13 RESPONSE SURFACE DESIGNS 


replications at the center of the design. The serum for each test was sampled from a serum pool 
spiked with a single dose of the drug. The data are shown next with reagent volumes in pl. 


Reagent 
AB % Drug Recovered 
10 20 a 
~ 40 20 44,47 
10 50 = Pee 
40 50 68, 72 
25 Ee 48, 53 


Estimate the experimental error variance. 

b. Estimate the linear response equation and standard errors of the coefficient estimates. Are the 
linear effects of the reagents significant? 

c. Estimate (1. for AB interaction and standard error of the estimate. Is there significant 
interaction? | 

d. Estimate the departure from a linear surface, 3); + (22, and standard error of the estimate. Is 
there a significant departure from a linear surface? 

e. Determine the first five steps on the path of steepest ascent from the center of the design with 

steps of one unit in x, for reagent A. Show the levels of both factors at each step. 


= 


2. Consider the experiment on vinylation of methyl glucoside in Example 13.1. Suppose the chemist 
has observed a maximum response at T = 175°C and P = 475.5 psi via the path of steepest ascent 
in Table 13.2. Use those levels as the average factor levels in Equation (13.4), and design a central 
composite rotatable design to estimate the quadratic response surface equation. Show the actual 
levels of T and P required for each treatment combination in the design. 


3. Show the coded design coordinates required for a uniform precision rotatable central composite 
design with four factors. 


4. Describe the response surface for the following canonical forms given for quadratic response 
surfaces: 


5. An animal scientist studied the relationship between metabolism of methionine, a sulfur amino 
acid, and carotene, vitamin A, as they affect the growth of chickens. The optimum levels of 
methionine and carotene were thought to be 0.9% methionine in the diet and 50 micrograms 
carotene per day. A central composite rotatable design was used for the experiment. Eight chicks 
were randomly assigned to each of the treatment diets, and their weight gains were recorded after 
38 days. The average weight gains for the treatments follow. 
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Original Factors Coded Factors 
“Methionine Carotene = &~—S—*=é<“C«~SD”sé‘(‘C;O#Weeightt Gain 

1.183 85.36 | 5 ae +4 — 445 
1.183 14.64 +1 —] 331 
0.617 85.36 —] +1 443 
0.617 14.64 —] —] 336 
1.183 50.00 g2 0 414 
0.500 50.00 — /2 0 389 
0.900 100.00 0 f2 435 
0.900 0.00 0 — /2 ‘eo 
0.900 50.00 0 0 442 
0.900 50.00 0 0 412 
0.900 50.00 0 0 418 
0.900 50.00 0 0 440 
0.900 50.00 0 0 44] 


a. Estimate the quadratic response surface equation for weight gain, and summarize the sum of 
Squares partitions in an analysis of variance table. 

b. Test the significance of the complete quadratic model, the quadratic deviations from the linear 
model, the significance of the linear components of the model, and the lack of fit to the 
quadratic model. What are your conclusions? 

c. The response surface has a maximum within the design coordinates. Determine the levels of 
methionine and carotene that produce the maximum response, and estimate the maximum 
response. 

d. Compute the canonical equation (see Appendix 13A.3), and describe the response surface. 
Based on the canonical equation, what is the relationship between methionine and carotene? 
Can one be used to compensate for the other in the animal’s diet? 


The experiment on vinylation of methyl glucoside used in Example 13.1 (Marvel et al., 1969) in- 
cluded four factors in a central composite rotatable design placed in an incomplete block design. 
The percent conversion of methyl glucoside to a vinylation product was the response variable of 
interest. The actual and coded levels of the four factors used in the experiment were 


Coded Level —2 — | 0 1 Z 
L1 Time, hours l 3 5 7 9 
x2 Temperature, °C MAD 62130 chido 460 2475 
XL3 Pressure, psi 7 2S 325778 800 2 475 9777550 
x Solvent ratio (water/dioxane) 95 80 65 50 ‘85 


The percent conversion of methyl glucoside to a vinylation product for each treatment combination 
in the experiment is shown in the table that follows: 
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ee Set Se | ee | 36 += 1 | | ] is (o.2 0 0 0 

ot ee 2) ope ed 1 43 2 0 0 0 
ce ee | ] ] Be ] l L 2 oe ~2 0 0 
46 ] 1 27 bid] 4 0 2 ) 0 
6 a ] 19 aS. = 4 ee 18 0 ee 0 
16 ee | 2 eee | 52 i te ] 52 0 0 2 0 
Feed | 1 fe 1 gece Sf cee Lael 15 0 0 0. = 
ee cote | | = 24 —-1 bo 1 = 58 0 0 0 2 

0 0 0 0 33 0 0 0 0 34 0 0 0 0 

0 0 0 0 30 0 0 0 0 39 0 0 0 0 


10. 


Block 1 Block 2 Block 3 
Ly L2 3 L4 7] Ty X2 £3 LA Y 


8 
8 
Ne) 
8 
Ww 
8 
1S 


a. Estimate the quadratic response surface equation for percent conversion, and summarize the 
sum of squares partitions in an analysis of variance table. 


b. Test the significance of the complete quadratic model, the quadratic deviations from the linear 


model, the significance of the linear components of the model, and the lack of fit to the qua- 
dratic model. What are your conclusions? 

c. Determine the factor levels that produce the maximum response with the quadratic model. Is 
the optimum within the current design factor levels? 

d. Compute the canonical equation from the quadratic model estimates (see Appendix 13A.3), 
and describe the response surface. Based on the canonical equation, what type of surface has 
been estimated? 


Construct an incomplete block design for a central composite design with three treatment factors 
such that the conditions in Equations (13.5) and (13.6) are satisfied. What is the value of a required 
for the relationship to hold in Equation (13.7)? 


An experiment is planned to evaluate the flavor quality of a fruit juice containing orange, pine- 
apple, lime, and papaya juices. The minimum allowable proportions of the four juices in the mix 
are orange > .15, pineapple > .10, lime > .10, and papaya > .20. Design a mixture experiment 
with a simplex-lattice design. List the design both as pseudocomponents and as original 
components. 


A highway-paving mixture is formed by dispersing liquid sulfur into liquid asphalt to produce a 
sulfur—asphalt binder. The binder is then mixed with sand to produce the paving mixture. A mix- 


_ ture experiment is to be conducted with a minimum of 5% sulfur and a minimum of 10% asphalt. 


a. Design the mixture experiment with a simplex-centroid design. Show the design as pseudo- 
components and as original components. 

b. Construct the design as an augmented simplex-centroid. Show the pseudocomponents and 
original components. 


A mixture experiment was conducted with the sulfur—asphalt binder described in Exercise 13.9. 
The experiment was conducted with a minimum of 10% sulfur, a minimum of 20% asphalt, and a 
minimum of 50% sand in the mixtures. Two replications of each mixture were prepared, and a 
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specimen from each replication was subjected to a strength test. A simplex-lattice design was used 
for the experiment. The strength data with mixture proportions for pseudocomponents, where 2, is 
the pseudocomponent for sulfur, x2 for asphalt, and x3 for sand, follow. 


Pseudocomponents Strength 
Ly LQ 3 Vij 
l 0 0 i240 3.7 
0 1 0 2.4, 3.6 
0 0 l 20,49 
Be ee | oe ae 2 18.9, 16.8 
oo 0.5 19.4, 17.1 
0 os. 03 4.6, 7.3 


a. Given the minimum values for sulfur, asphalt, and sand, determine the actual proportions for 
the three components for each of the treatment mixtures in the experiment. 
. Estimate the experimental error variance. 
c. Estimate the linear and quadratic response surface polynomial. Determine the significance of 
the linear terms and the significance of the quadratic addition to the model. 
d. Interpret the coefficients in the model. 


13A.1 Appendix: Least Squares Estimation of Regression 
Models 


The least squares estimation of parameters for the regression model follows the 
procedures illustrated in previous chapters for various experiment designs. The dif- 
ference to be noted for the regression model is the inclusion of the continuous value 
x; variables in the model that were not seen previously in the experiment design 
models. 

Estimation is illustrated for a model with two x; variables. The multiple linear 
regression model is 


Dye Bree; ea + Goak, + €; | t= 1, 2,...,n 


The intercept or constant term for the model written in the general form as {o20; is 
most often identified in the model only as {4 since the variable xo; takes on a con- 
stant value xo; = | for all observations. 

The least squares estimators for the regression coefficients are found by 
differentiating 


n 


~ = = (yj; — Botoj; — Pit1; — Box0;)° 
aa 


j=1 


with respect to each of the (; and setting the result equal to 0. The partial deriva- 
tives set to 0 are 
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‘e) nr 
ae c 2 ~ Lo;(¥j — Po®o; — Pixij — P2%2;) = 0 
0 pe 
fa) n 
cee 2 >. £1;(Yj — Po®o; — Pili; — Prr2;) = 0 
UP-1 j=l 
O nr 
Bo,csi 2S © 224(yj — Boro; — Art1y — B2r2j) = 0 
2 j=l 


The resulting normal equations are 


Bo > x, + By > L0jL1j + Bo > L0jL25 = > LOjY; 
a. > L0jL15 + By > Xi = is Bo > XL15025 = > L15Y; 


Bo iG L0jL25 + By x Ly jL25 + Bo DD 25, a L25Yj 


The simultaneous solutions of equations for the G, result in the least squares 
estimators for the G;. The estimates for a given problem can be obtained with any 
statistical computing package that includes a program for multiple linear regression. 

The interested reader can find a detailed account of regression analysis 
methodology in Rawlings (1988). A brief outline of the model formulation, con- 
struction of the normal equations, and solutions to the normal equations are given 
here assuming some knowledge of matrix notation. The multiple linear regression 
model can be written in matrix form as 


y=XBP+e 
where 
Yi La el Bo e1 
yate lot gd Heteresteet gh} ole 
Un L fin Bhs 6 en 


Denoting .X’ as the transpose of the X matrix the normal equations in matrix form 
are 


X'XB=xX'y 


where 
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n Ly ee ees Se Ly 
iL] iat LX Shae 21 LE xy 
X'X = xr 21 Xo Lae a LoL, X'y = LiLoy 
Po, Safle, Se? Dany 


The solutions to the normal equations for @ are found by multiplying both sides of 
the normal equations by the inverse matrix for X’X denoted (X’X)~!. The solu- 
tion is written as 


B=(X'Xy'X'y 


The computations are illustrated with data for two independent variables, x, 
and 2, to estimate the coefficients for the first-order model y= 
Bo + 1x1 + Box2 + e. The data matrices are 


ge ee 6a 67, 79.) 


es Oe a wis eons aap 
MIE Taleo 3.425 6 7 

ms 33 2-3 3 

ieee ee 432 
X’X=|28 140 63| X’y= | 1,884 

ma 3 8 921 


16 003. — 06.44 
ky) =1 60). 606 —0.11 
og AA OT 5 ga Ad 


The solution to the normal equations is 


116 —003 -—0.44] [ 432 31.43 
B=(X'X) 1 X'y= | —0.03 0.06 —0.11] | 1,884] = | 3.57 
—0.44 —0.11 0.44 921 8.00 


The estimated equation is 9g; = Bo Xo + 6121; eos & BK, and the sum of squares 
for experimental error for the full regression model is calculated as 


SSE; =) (yj - 3; = Y- XB) Y- XB) =y'y- BX 


ts 

s a -_~ “_ 
=) oy} —B.Dy; — B,Darjy; —- — B, Dany; 
rs 


with n — k — 1 degrees of freedom. 
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The reduced model without the variables 2,;,1=1,2,...,k, 1s 
y; = Bor0; +e; =ut+e,;. The estimated regression equation with the reduced 


model is ¥; = BoXo- The sum of squares for experimental error for the reduced 
model is | 


SSE, =~ (y3- 3) = 093? — Bo doy; 
j=l gk j=l 


with n — | degrees of freedom. 


For the reduced model Bo =y7 al Soe, => y )*. The sum of squares 
j=l 


for regression due to the inclusion of the independent variables z;,2 = 1,2,...,k, 
in the model is 


SSR =SSE,- SSE; 


with k degrees of freedom. 
Given » y5 = 27,716, the experimental error sum of squares for the full model 
of the example is 


SSE; = 27,716 — 31.43(432) — 3.57(1,884) — 8.0(921) = 44.4 


with n—k—1=7—2-—1=4 degrees of freedom. The estimate of (0 for the 
reduced model is 7 = 61.7, and the experimental error sum of squares is 
SSE, = 1055.4. The sum of squares for regression is 


SSR = 1055.4 — 44.4 = 1011.0 


with k = 2 degrees of freedom. 


13A.2 Appendix: Location of Coordinates for the Stationary 
Point 
The estimated quadratic model expressed in matrix form is 


9=6,+2'b4+2'Ba (13A.1) 


where 


B, | 
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Bu By» /2 By3/2 sa By,/2 
B12/2 Pa By3/2 a Box /2 
B= By3/2 Bo3/2 P33 = Bx,/2 


ByfaB 2 Baya iw bo Bec 


The stationary point is found by setting the derivative of 7 with respect to the x 
vector equal to 0, 


jp 
— =b+2Bzr =0 (13A.2) 
Ox 
The vector of design coordinates for the stationary point is the solution to Equation 
(13A.2), or 
] 
Co. = — — = (13A.3) 


Substituting the solution into Equation (13A.1), the estimated response at the sta- 
tionary point is 


a 4 
Ys = Bot 52.6 (13A.4) 


13A.3 Appendix: Canonical Form of the Quadratic Equation 


The canonical form of the quadratic response equation is 
G=9,+AZi + ros +--+ AZ? (13A.5) 


where the A; are the eigenvalues of the matrix B in Equation (13A.1) and the Z; 
are variables associated with the rotated axes that correspond to the axes of the con- 
tours of the response surface. The origin for the rotated coordinate system is the 
stationary point with all Z; = 0 and response ¥,. 

The eigenvalues of B are the roots of the determinantal equation 


|\B-—dF| =0 (13A.6) 
where J is the identity matrix. The relationship between the matrix B and the ), is 
Bm; = m;; i= 1.2. seey ke (13A.7) 


where the m, are the eigenvectors corresponding to the \;. The mm; are normalized 
so that mm; = 1. 

The relationship between the variables representing the coded factor levels x 
and the canonical equation variables Z is 
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Z = M'(xz—z,) (13A.8) 


where the columns of M are the normalized eigenvectors 77. 

The various methods required for the calculations can be found in standard ma- 
trix algebra books such as Graybill (1983). The matrix calculations can be per- 
formed with many of the common computer programs. The calculations are illus- 
trated with the response equation for Example 13.3, 


@ = 169 + 6.74721 + 26.3852 — 10.875? — 21.62522 — 15.25a 22 
The determinantal equation is 


— 10.875 — A = 1.029 
= 7:625 — 21.625 — Xr 


with \? + 32.5\ + 177.03 = 0. The roots of the quadratic equation are 4; = 
— 25.58 and \» = — 6.92. Thus, the canonical equation with ¥, = 177.25 is 


9 = 177.25 — 25.582? — 6.9225 


The matrix of normalized eigenvectors is 


[0.4603 0.8877 
~ [0.8877 — 0.4603 


The coordinate of the stationary point is x71, = —0.156 and x9, = 0.665, and the 
relationship between the canonical variables and the coded factor variables is 


Z| | 0.4603 0.8877 | | (7; + 0.156) 
Z|  |0.8877 —0.4603 | | (x2 — 0.665) 


or 


Z, = 0.4603(a1 + 0.156) + 0.8877(a2 — 0.665) 


Z> = 0.8877(a + 0.156) — 0.4603(x2 — 0.665) 


14 oplit-Plot Designs 


This chapter introduces the split-plot design for experiments with a factorial treat- 
ment design and describes some unique features of the design relative to its struc- 
ture, composition of experimental errors, and analysis. The relative efficiency for 
split-plot designs is also discussed. Extensions and variations of the design include 
the split-split-plot and split-block designs. 


14.1 Plots of Different Size in the Same Experiment 


One factor sometimes requires more experimental material for its evaluation than a 
second factor in factorial experiments. In agronomic or horticultural field trials a 
factor such as cultural methods may require the use of equipment that is best-suited 
for large plots, whereas another factor in the experiment such as cultivar or fertility 
level may be applied easily to a much smaller po of land. The larger cultural treat- 
ment plot, the whole plot, is. split into smaller subplots to which the different culti- 
vars or fertility treatments are applied. ° This is s known as a split-plot design, and in 
this particular example there are two different sizes of experimental units. 

The experiment used for the following example illustrates the creation of a 
split-plot design when a second factor was introduced to subdivisions of the exist- 
ing experimental units for an experiment already in progress. 


Example 14.1 Nitrogen Fertilizer and Thatch Accumulation 
in Penncross Creeping Bent Grass 


The soil for most golf greens is almost pure sand and frequent irrigation and 
fertilization are required to maintain the turf. The sandy soil has little capacity 
to retain nitrogen, and after fertilization the nitrogen quickly leaches from the 
root zone after irrigation. Administering large initial doses of nitrogen to 
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retain nitrogen for a longer period of time is harmful to the turfgrass and soil 
microbes. Thus, a means to apply moderate rates of nitrogen and to retain 
nitrogen in the root zone would be very beneficial. 

Nitrogen fertilizers are manufactured in various chemical configurations. 
Two commonly used fertilizers, ammonium sulphate and urea, were known 
to be fast-release nitrogen forms and were expected to leach out of the soil 
very quickly. Others such as isobutylidene diurea (IBDU) and sulphur-coated 
urea, urea(SC), although more costly, were known to be slow-release nitro- 
gen forms and expected to stay in the soil for a longer period of time. 

A second factor that could affect nitrogen retention was the turf thatch, or 
the buildup of dead grass. The thatch was removed frequently because it was 
thought plant diseases accumulated in the buildup. However, thatch provides 
a capacity to retain nitrogen fertilizer and could partially relieve some of the 
difficulty of nitrogen loss with the sandy soil. 


Research Objective. A soil scientist wanted to investigate the effects of 
nitrogen supplied in different chemical forms and later evaluate those effects 
combined with those of thatch accumulation on the quality of an established 
turf. : 


T reatment ei a four forms of Sisidindo —— _ = the sue 
were (1) u >) ammonium - 1 | di (IBDU). and 


to athe turf at a i fale er: Gui of nitrogen per 1000 square feet of turf. Any 
differences in responses to the fertilizers then could be attributed to the mode 
of nitrogen release because an equivalent amount of nitrogen was supplied by 
each form. 


Experiment Design: A golf green had been constructed and seeded with 


| med. The lengths of time the thatch 3 was Palcwed to aaekts on 
the Giiplor green were two, five, or eight years. 

The resulting split-plot design had the whole-plot treatment factor of nitro- 
gen source in a randomized complete block design with years of thatch accu- 
mulation as a subplot treatment factor. 

The field plan for one replication of the split-plot design for the turfgrass 
experiment after randomization is shown in Display 14.1. The nitrogen 
whole-plot treatment, factor A, has four levels, and the years of thatch accu- 
mulation subplot factor, factor B, has three levels. 
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Display 14.1 Split-Plot Experimental Layout 
for the Turfgrass Experiment 


Subplot 
Factor B 


per, 
Whole Plot 
Factor A 


Source: Dr. 1. Pepper, Department of Soil and Water Science, University of Arizona. 


An Altered Randomization for Split-Plot Designs 


The usual randomization of the factorial treatment combinations to the experimen- 
tal units has been altered to accommodate the particular requirements of the experi- 
ment. For example, the split-plot design in Display 14.1 has one additional 
restriction in the randomization relative to that of the usual randomized design. In 
= sot UEP Tat ree fenced Sie is Be ps 


More Possibilities for Two Unit Sizes 


If one factor, such as temperature, humidity, or photoperiod, requires environmen- 
tal control chambers, then each level of that factor requires a separate chamber. A 
second treatment factor such as culture media for plants and even a third treatment 
factor can be included within each of the chambers, thereby achieving an economi- 
cal use of the chambers to study more than one factor at a time. The chambers 
represent whole plots, and the subplots are units to which the second factor lev- 
els are applied inside each chamber. 
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In education research a teaching method is applied as a whole-plot treatment to 
an entire classroom; however, subgroups of students within each classroom can be 
used as subplots to study an additional factor, such as the use of certain library 
materials or microcomputers. 

Industrial research may require replicate batches of raw material mixtures. The 
whole-plot treatment, such as product mixtures, can be one batch of raw material, 
and a subplot treatment, such as curing time, can be applied to subbatches of the 
larger product mixture batches. 


14.2 Two Experimental Errors for Two Plot Sizes 


The statistical analysis must take into account the — of two ene hak ts or 
sizes of Sa a tion units in the - Scllidaig te Fac IT 


| om the subplots for factor B. Since the wile plots and FP ratplats are 
experimental units of eee sizes or apes. they have different precision. The dif- 
fering precision must be taken into account for making comparisons among treat- 
ment means. 


¢ ated. he correlation retiecls the nature of oral units to rend simi- 
laely when they are adjacent to one another, such as neighboring field plots, stu- 
dents in a classroom, cultures in a growth chamber, or units from the same batch of 
raw materials in the industrial experiment. 

We assume a correlation p between observations on any two subplots in the 
same whole plot and also assume observations from two different whole plots are 
uncorrelated. Given these assumptions, it can then be shown that the error variance 
for the main effects of A on a per-subplot basis is o?[1 +(b — 1)p] if there are b 
subplots in each whole plot. Likewise, for comparisons among the main effects of 
B and AB interaction comparisons the error variance per subplot is o7(1 — p). 

As a sessed of these differences 1 in the errors associated with the whole 
: Sed mala um of squares in the 
1 somewhat fre on for a two-factor 


same as athe or the — — designs, but the expermnenia) error is partitioned 
into two = nponent of € experimente tal error is associ 

=HDlo yt trea atment factor and the « yther cor mHnoneT 7c associa’ ad with i i Bntot 
frbanenent pone and interaction. 
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14.3 The Analysis for Split-Plot Designs 


The Split-Plot Model 


A mixed-model formulation is used for the split-plot design to reflect the separate 
experimental error variances for the subplot and whole-plot units. It includes sepa- 
rate random error effects for the whole-plot units and the subplot units. If the 
whole-plot treatment factor is placed in a randomized complete block design the 
linear model is 


Yijk = +a; + pe t+ dix +B eet (a8);; + Ci5k (14.1) 
P= 1, 255..58 7=1,2,...,6 ae Oe a 


where yu is the general mean, q; is the effect of the ith level of factor A, p, is the ef- 
fect of the kth sete diz is the whole-plot random error, ; is the effect of the jth ; 
level yf factor B, (af);; is the interaction effect between factors A and B, and e;;, 
is the subplot random ¢ error. 

The whole-plot and subplot errors are assumed to be sre. caper, normally 
distributed random errors BATE mean a ae variances ae and o?, ae Ran- 
domization | of treatments to the exp erimental units justifies tt assumption of ” inde- 
pendence for the aan ae om braid the onl coment between the errors for 
subplot units on the same whole-plot unit. 


The Split-Plot Analysis of Variance 


The expectation of the mean squares for the analysis of variance using the mixed 
model in Equation (14.1) is shown in Table 14.1 with fixed effects for factors A 
and B. 


Table 14.1 Expected mean squares for the split-plot analysis of variance . 


Source of Degrees of Mean Expected 
Variation Freedom Square Mean Square 
Blocks r—1| MS Blocks 

A a—1 MSA o? + bo? + rb0? 
Error(1) (a — 1)(r—1) MSE(1) o? + bo? 

B b—] MSB o? + rab? 

AB (a — 1)(b— 1) MS(AB) o2 + 162, 
Error(2) a(r — 1)(b— 1) MSE(2) o? 


The expected mean squares for Error(1) and Error(2) based on the mixed 
model reflect the differences in Si variability for the two different types of 
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experimental units. The expected error variances for the whole plots are larger than 
those for the subplots. 


Tests of Hypotheses About Factor Effects 


The F statistics to test the null hypotheses for interaction and main effects are 


e (interaction) Ho: (a8);; = 0 versus H,: (af); # 0 for some z, 7 


MS(AB) . 
Pgs “MSEQ) with (a = 1)(b aa 1) and a(r pra 1)(b = 1) d.f. 
e (factor B main effects) Ho: 7, =--: =f, versus H,: 2; # 7; for some 
t, J 
MSB =. 
iy = MSE(Q) with b — 1 and a(r = 1)(b aes 1) at 
e (factor A main effects) Ho: fi; =--: = f,, versus H,: f; #7; for some 
459 
MSA _., 
Fo = MSE(\) with a — 1 and (a ge 1\(r - 1) d.f. 


Example 14.2 Data and Analysis 


One of the measurements made on the turfgrass plots of Example 14.1 was 
the chlorophyll content of clippings (mg/g) sampled from each plot. The data 
are shown in Table 14.2, and the analysis of variance is shown in Table 14.3. 


ee 


Table 14.2 Chlorophyll content (mg/gm) of grass clippings 


Years of Thatch Accumulation 


Nitrogen Source Block i 5 8 
Urea 1 3.8 53 59  %& 
2 3.9 5'4 45 452 
Ammonium sulphate l 5.2 5.6 54 6.9 
2 6.0 6.1 62 4.4 
IBDU 6.0 5.6 78 6460 
2 7.0 6.4 738 SpE 
Urea (SC) 6.8 8.6 g.5 7, be 


2 7.9 8.6 84 &.73 


ee 
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Table 14.3. Analysis of variance of chlorophyll content of Penncross creeping 
bent grass clippings 


Source of Degrees of Sum of Mean 


Variation Freedom Squares Square F PreS fF 
Total is 48.78 
Block ] 0.51 0.51 
Nitrogen (J) 3 a7 .32 12.44 29.62 0.010 
Error (1) 3 1.26 0.42 
Thatch (7) - 2 3.82 1.91 9.10 0.009 
Nxt 6 4.15 0.69 3.29 0.065 
Error (2) 8 Liz 0.21 

Computational Notes 


The analysis of variance for the split-plot design can be computed with many 
available statistical es packages. The instructions for the analysis are 
alent to those for a two-factor factorial treatment design except for the neces- 

sity 6 ae oe aici error terms for the analysis. 

The sum of squares for Error(1) is numerically equivalent to the error sum of 
squares for the experiment design that was utilized for the whole plots. With com- 
pletely randomized designs it is equivalent to the sum of squares for whole plots 
within factor A treatments. For randomized complete block designs it is equivalent 
to the computation of the sum of squares for Blocks x A interaction. The sum of 
squares for Error(2) is ordinarily the residual sum of squares computed auto- 
matically by the program. 

Alternatively, the analysis can be conducted with statistical programs specifi- 
cally written for mixed models, which have been incorporated into many statistical 
computing packages. They can be particularly useful for split-plot designs in 
blocked designs with random blocks. The statistical estimation in these programs is 
based on maximum likelihood methods, which are beyond the scope of this book. 

The test for interaction between nitrogen and thatch is Fy = 0.69 70.21 = 3.23, 
and the test is not significant with Pr > F' = .065 (Table 14.3). The test for differ- 
ences among the thatch means is Fo = 1.91/0.21 = 9.10, and it is significant with 
Pr > F' = .009. The test for differences among the whole-plot nitrogen treatment 
means is Fo = 12.44/0.42 = 29.62, and it is significant with Pr > F = .01. 

The cell means and the marginal means for source of nitrogen and years of 
thatch accumulation are shown in Table 14.4. Turf that received the sulphur-coated 
urea, urea(SC), had the highest chlorophyll content followed by IBDU and ammo- 
nium sulphate while urea resulted in the lowest amount of chlorophyll. This relative 
ranking of the chlorophyll content by source of nitrogen was the same for each year 
of thatch accumulation. 3 

The graph in Figure 14.1 shows a plot of the mean chlorophyll content versus 
years of thatch accumulation separately for each of the nitrogen sources. The 
marginal means in Table 14.4 indicate an increase in chlorophyll content as the 
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Table 14.4 Mean chlorophyll content (mg/g) of Penncross creeping bent grass 


Years of Thatch Accumulation 


5 


8 


Nitrogen Means 


er ne EEUU SESE Say" 


clippings 
Nitrogen Source Z 
Urea 3.85 
Ammonium sulphate 5.60 
IBDU 6.50 
Urea(SC) 1235 
Thatch means 5.83 


Bee 
5.85 
6.00 
8.60 
6.45 
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7.80 
8.45 
6.79 
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@ Ammonium sulfate 
# Urea 


Years of thatch accumulation 


4.77 
3.75 
67] 
8.13 
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Figure 14.1. Mean chlorophyll content of Penncross creeping bent grass clippings 
versus nitrogen source for two, five, and eight years of thatch accumulation 


years of thatch accumulation increase. However, observation of the cell means and 
their plotted values in Figure 14.1 shows that this general trend was not manifested 
for each of the separate sources of nitrogen. 

This difference in the trends among the nitrogen sources indicates the possibili- 


e form of interaction may be present even though the global test for 


interaction was not significant. Partitions of the interaction sum of squares will 
sometimes reveal the presence of certain forms of interaction between the factors. 
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Interpretation with Regression Contrasts 


The factorial treatment design is composed of one quantitative factor, years of 
thatch accumulation (T), and one qualitative factor, source of nitrogen (VV). The 
treatment design lends itself to the use of the polynomial regression partitions in the 
analysis of variance for one quantitative factor with one qualitative factor 
(discussed in Chapter 6). Linear and quadratic regression sums of squares partitions 
can be computed for years of thatch accumulation (7) with corresponding parti- 
tions for the nitrogen x thatch interaction. These partitions are shown in the 
analysis of variance for the experiment in Table 14.5. 


Table 14.5 Analysis of variance for chlorophyll content of Penncross creeping 
bent grass clippings with orthogonal polynomial regression partitions for thatch 
factor 


Source of Degrees of | Sum of Mean 
Variation Freedom Squares Square FP 'ga2 3 
Total 23 48.78 
Blocks l 0.51 0.51 
Nitrogen (N) 3 37.32 12.44 29.62 0.010 
Error(1) 3 1.26 0.42 
Thatch (7) 2 3.82 1.91 9.10 0.009 
T linear l a4 Les 17.67 0.003 
T quadratic | 0:11 0.11 0.52 0.494 
Nor 6 4.15 0.69 3.29 0.065 
N x T linear 3 0.80 0.27 1.29 0.358 
N x T quadratic 3 3.36 - 3.33 0.028 
Error(2) 8 L2 0.21 


The interaction for quadratic deviations of nitrogen x thatch is significant with 
Fo = 1.12/0.21 = 5.33 and Pr > F = .028. The linear regression partition for 
thatch (7°) is significant with Fo = 3.71/0.21 = 17.67 and Pr > F = .003. Thus, 
there are significant quadratic deviations from the linear response to years of thatch 
accumulation that differ among the sources of nitrogen. The differing patterns were 
observed in Figure 14.1. 

Several types of comparisons may be of interest at this point to further clarify 
the interpretation. Given there was a significant regression interaction component, 
it will be beneficial to make a comparison among the nitrogen means for each of 
the years of thatch accumulation to determine ‘if the sulphur-coated urea always re- 
sulted in the highest chlorophyll content of the turf. Also a quadratic regression of 
chlorophyll content on years of thatch accumulation can be computed for each of 
the nitrogen sources to characterize the differences among the nitrogen sources 
relative to years of thatch accumulation. 
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14.4 Standard Errors for Treatment Factor Means 


The standard errors shown i in Table 14.6 are used for tests of hypotheses for com- 
parisons among the estimated treatment means. The degrees of freedom associated 
with each of the standard errors are those for the mean square used in the standard 
error. The only exception is the final comparison shown in the table where the stan- 
dard error is a weighted combination of the two mean squares for error. Conse- 
quently, the appropriate degrees of freedom can be approximated by the procedure 
introduced by Satterthwaite (1946) (Chapter 5). The approximation is 


((b — 1)MSEQ) + MSE(P? 
(6-1) MSEQ))? [MSE())/? 
A ea ee aan ae Bey eee, 

fo fi 


where f; and fo are the degrees for MSE(1) and MS E(2), respectively, from the 
analysis of variance. 


df, = (14.2) 


Table 14.6 Standard error estimators for the split-plot design (A = whole-plot 
factor, B = subplot factor) 


Treatment Comparison Standard Error Estimator 
Difference between two A means 


= : /2MSE(\) 
Yu. ~ Be.. oe 
} rb 


Difference between two B means 
2MSE(2) 


Yu. — Yr. | ae 


Difference between two B means 
at the same level of A 


Je ans 2M SE(2) 
Yiu. oe, Y ju. W ea oa 


Difference between two A means 
at the same or different level of B 


b A ef eee 26 — NMSEQ)+ MSE(Q) 
Yuk. — Yok. OF Yuk. set Yom. a 


It should be noted that comparisons requiring standard errors based on the line- 
ar combination of mean squares, (b — 1)MSE(2)+ MSE(\), only have approxi- 
mate probability levels for tests of hypotheses and confidence intervals. The linear 
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combination of mean squares no longer shares the same probability distribution 
properties of the individual mean squares. 


Standard Errors for the Turfgrass Experiment 


Referring to Table 14.6 the standard errors required for the analysis of Example 
14.1 are differences between 


‘ene é » é oO 
. a NDS 
e two nitrogen means ~~ {yo / 


IM SEC 2(0.42 
go" ee) = 0) = 0.37 with 3 degrees of freedom 
} At —_ 6 
$l V'? 2. ¥ of Je vege, af og ws 
e two thatch means! a 


[2MSE(2 /2(0.21 
i ek = a = 0.23 with 8 degrees of freedom 
Ta 


e two thatch means for the same nitrogen 


[2M /2(0.21 
a = — = 0.46 with 8 degrees of freedom 
r 


e two nitrogen means at the same or different thatch levels 


2[(6 — DMSE() + MSE())] 2[2(0.21) + 0.42] 
So yf = 83 


where the degrees of freedom for the last standard error obtained from the 


Satterthwaite approximation. are 
[2(0.21) + 0.42]? 
[2(0.21)|? [0.42]? 
8 . 3 


df= = 8.73 or 9 (14.3) 


The experimental error variance for subplots, 14S E(2) = 0.21, is one-half the 
magnitude of ene Ss error variance estimated for the whole plots, 
M a = 0.42. Bie comparisons among years of thatch accumulation 
neans and ‘interaction 1 meals will be more meee ten comparisons : among source 
of seen means. The question of whether there has been an equitable trade-off 
between the advantages and disadvantages of splitting the plots will be partially 
answered by the relative efficiency computations in Section 14.6. 
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14.5 Features of the Split-Plot Design 


mization for the split-plot design requires that levels of factor A be random- 
ized to the WHO units in accordance with the protocol for the experiment design in 
which the whole units are arranged—that is, completely randomized, randomized 
complete block, and so forth. Levels of factor B are assigned to the subunits within 
each of the whole units at random, separately for each of the whole units. 

The design also can be described as a confounded factorial design (discussed in 
Chapter 11). The subunits are regarded as the experimental units, so that the levels 
of factor A are applied to groups or blocks of the subunits. Consequently, compari- 
sons among the levels of A are confounded with the blocks of subunits. The split- 
plot design is often referred to as a confounded factorial design in which the main 
effects are confounded with blocks, whereas confounding was restricted to interac- 
tions for the designs in Chapter 11. 


The Analysis of Variance for Common Experiment Designs 


The sources of variation and degrees of freedom for the sum of squares partitions 
are shown in Table 14.7 for the split-plot design in which the whole plots are ar- 
ranged in three of the most common experiment designs—the completely random- 
ized, randomized complete block, and Latin square designs. Each design has r 
replications, a levels of A, and 6 levels of B. 


Table 14.7. Sources of variation and degrees of freedom for the split-plot design 
analysis of variance 


Completely 
Randomized Con 3 
Whole Pie | 
Source af. Source af Source af 
Rows a— | 
Blocks r—1- Columns a—1 
A a—1 A a— | A a—| 
Error(1) a(r — 1) Error(1) (r—1)(a—1) Error(1) (a—1)(a—2) 
Subplots 
B b—1 B b—1 B b-—1 
AB (a — 1)(b— 1) AB (a — 1)(b — 1) AB (a — 1)(b— 1) 


Error(2) a(r—1)(b— 1) Error(2) a(r—1)(6— 1) Error(2) a(a—1)(b— 1) 


Different Precision for Whole-Plot and Subplot Factor Means 


The larger degrees of freedom associated with the estimates of experimental error 
for the factor B means and the AB interaction means and comparisons among them 
indicate that the B and AB effects are more precisely estimated than the A 
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effects measured on the whole plots. However, this can be misleading as both 
MSE (1) and MS E(2) for the split-plot design have fewer degrees of freedom than 
does the mean square for experimental error in the design that would occur without 
the split-plot feature for the two-factorial effects. For example, the same factorial 
design in a randomized complete block design without the restrictions of the split- 
plot design would have (r — 1)(ab — 1) degrees of freedom for experimental error, 
which exceeds that for either MSE(1) or MSE(2) with (r—1)(a—1) and 
a(r — 1)(b — 1) degrees of freedom, respectively, for the split-plot design. | 

Practical experience with split-plot designs has shown that MS'E(2) often is 
smaller than M SE(1) as. anticipated from the expected mean squares. Consequent- 
ly, there can be an ae the precision of estimates of B and AB effects that 
offsets the loss in degrees of freedom. However, the average experimental error 
over all treatment effects is ine same with or without the split-plot feature. Thus, 
there is no net gain from the split-plot design. Rather, if there is a gain in the preci- 
sion on B and AB effects it is offset by a loss in the precision on A effects. Conse- 
quently, large and interesting A effects possibly may be judged nonsignificant. 
Evaluation of the relative efficiency of the split-plot design in this context is dis- 
cussed in Section 14.6. 


Advantages of the Split-Plot Experiment 


The primary advantages of the split-plot have already been mentioned: when one 
factor requires considerably more experimental material than another factor, such 
as in the agronomic field studies, or when there is an opportunity to study responses 
to a second factor while efficiently utilizing resources, such as in the growth cham- 
ber studies. The experiment in Example 14.1 illustrated the creation of a split-plot 
design by introducing a second factor to subdivisions of the existing experimental 
units of an experiment already in progress. 


Disadvantages of the Split-Plot Experiment 


The primary disadvantages of the split-plot design include the potential loss in pre- 
cision in treatment comparisons and an increase in complexity of the statistical 
analysis. The analysis of variance and the estimation of the standard errors for dif- 
ferent types of treatment comparisons require additional computations. 


14.6 Relative Efficiency of Subplot and Whole-Plot 
Comparisons 


Ordinarily, the relative efficiency of an experiment design refers to the efficiency 
as a result of blocking by some factor relative to that ignoring the blocking factor. 
With split-plot designs it is informative to consider the relative efficiency of using 
the split-plot design in lieu of the same experiment design without the split-plot 
feature. For example, when the whole-plot treatment factor is arranged in a 
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randomized complete block design it is possible to determine which of the designs 
is more efficient for the whole-plot treatment comparisons and which is more effi- 
cient for the subplot treatments and interaction comparisons. As discussed in Sec- 
tion 14.5 there is a compromise between the increase in precision on the subplot 
treatment means and the decrease in precision on the whole-plot treatment means. 


Efficiency of Subplot Comparisons 


Federer (1955) shows the efficiency of the split-plot design relative to the random- 
ized complete block design for the subplot comparisons to be 
Ls ee al) oa = ca 5: Hig) 


(14.4) 


where 


_ (At Di +3) 
(fi +32 +1) 


fi = a(b — 1)(r — 1), the degrees of freedom for the subplot error MSE(2), and 
fo = (ab — 1)(r — 1), the degrees of freedom for the randomized complete block 
experimental error. 


Efficiency of Whole-Plot Comparisons 


The relative efficiency of the split-plot design to the randomized complete block 
design for the whole-plot comparisons is 


ab ese) + a = =D MSE) 


(14.5) 


with f; = (a — 1)(r — 1) for the whole-plot error MS E(1). 

The efficiency of the split-plot design for thatch and nitrogen x thatch subplot 
comparisons relative to that if all 12 different combinations had been randomly 
allocated to the 12 plots in each block with a=4, b=3, r=2, f; = 8, and 
fo == IPS 


— (9)114) _ 8(0.21) + 3(0.42) 
=) 11(0.21) 


= 0.95(1.27) = E21 


or a gain of 21%. For nitrogen whole-plot comparisons with f; = 3 and f> = 11 
the relative efficiency is 


OU 8(0.21) + 3(0.42) 


Tenia) © = gaye ar Re) = 8 


or only 49%. Therefore, a gain of only 21% in efficiency was realized for the sub- 
plot comparisons, and a loss of 51% was realized for the whole-plot comparisons 
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with the split-plot design relative to the standard randomized complete block 
design. 


14.7 The Split-Split-Plot Design for Three Treatment Factors 


The convenience of introducing a third factor into the treatment design requires a 
subdivision of the subplots such that all levels of the third factor are administered to 
these new subdivisions, referred to as sub-subplots. The design, commonly referred 
to as the split-split-plot, has three different sizes or types of experimental units. 
The analysis requires the computation of an additional sum of squares for experi- 
mental error associated with the sub-subplots. The partition of the sum of squares 
for factors A and B is that shown in Table 14.7. If the third factor, C’, has c levels 
the additional sources of variation and associated degrees of freedom in the analysis 
of variance are 


_ Source Degrees of Freedom 
C c— 1 
AC (a—ite—1) 
BC (b— 1)(c — 1) 
ABC (a — 1)(b— 1)(c — 1) 
Error(3) ab(r — 1)(c — 1) 


The sum of squares for Error(3) is the usual residual sum of squares produced 
by a computer program. Special instructions must be given for separate computa- 
tion of the sum of squares for Error(2). For randomized complete block designs the 
sum of squares for Error(2) is computed as the pooled sum of squares for the 
Blocks x B and Blocks x AB interaction. 

All of the standard errors applied to A and B effects shown in Table 14.6 are 
valid and need only have the value of c included as part of the divisor. For exam- 
ple, the standard error for the difference between two factor A means Y,,, — Yy,,, 1S 
,/2MSE(1)/rbc. The standard errors required for comparisons involving the fac- 
tor C effects are shown in Table 14.8. 


14.8 TheS ck Design 
Numerous variations of the basic split-plot design have been used by investigators 
beyond further subdivisions of the experimental units. One common variation is the 
split-block design in which the subunit treatments occur in a strip across the whole- 
plot units. The design is sometimes called a strip plot design. 

The split-block design can be useful in agricultural field studies when two 
treatment factors require the use of large field plots. The levels of one treatment 
factor A are randomly assigned to the plots in a randomized complete block design. 
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Table 14.8 Standard error estimators for the split-split-plot design 


Treatment Comparison Standard Error Estimator 


= zs /2MSE(3) 

Two C' means (¥_, — Y.,) 7 

ra 
Two C’ means at the 2M SE) 
same level of A (Y;,, — ¥;.) ee ee 
Two C means at the | 2M SE(3) 
same level of B (Y;,, — Yin.) re 
Two C’ means at the 2M SE(3) 
same level of A and B (Yi, — Y;jv.) : Se 


Two B means at the same or 


different levels of C 2[((c - )MSEG)+ MSE(2)| 
(Yu — Gee fF rac 
Two B means at the same 2[((c - )MSEG)+ MSE(2)| 
level of A and C (Yj; — Yivj.) - 


Two A means at the same or 


different level of C 2[(c — I) MSE(3)+ MSE(1)| 
Gua Vue OP Ye Gog) rbc 


Two A means at the same 
or different levels of B and C’ 2[b(c — DM SE(3)+ (6—- 1I)MSEQ)+ MSE) 


(for example, 7,,;; — Yoij. OU Guiz. — Yorr.) rbc 


The plots for the second factor B are constructed in the same manner but are laid 
out perpendicular to the plots for factor A. The levels of the second factor B are 
then randomly assigned to this second array of plots across the same block. A 
sketch of one block for the split-block design is shown in Display 14.2, in which 
there are a = 3 levels of factor A and b = 4 levels of factor B. 
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Display 14.2. Experimental Units for One Block 
of a Split-Block Design 


Experimental Unit 
for Factor A 


Experimental Unit Experimental Unit 
for Factor B for AB Interaction 


Three Sizes of Units and Three Experimental Errors 


The split-block design has three sizes of experimental units where the units for the 
main effects of A and B are equivalent to whole plots, each of different orientation. 
.B interaction effect is a subplot where there is an intersection of 

the two whole sits for the respective levels of A and B. Consequently, a separate 
experimental error must be estimated from the analysis of variance for each of the 
three treatment effects. 

The linear statistical model for the split-block design differs from that for the 
ordinary split-plot design in that it has separate random error terms for each of the 
three treatment effects. A blocking ae px is also fags in the model 


‘i> 


Yijk = UL + Pk + OG oer ver By + din + (Bhs + ee (14.6) 
Beg) OS eae ae b( k= 1,2, 


ae "el, ee Aid. ieee 
The random error effects, dix, gjx, and e;;, are the Baht Bi al errors ‘for the unite : 
associated with A, B, and AB effects, respectively, with variances 0%, a, and o?. 
The analysis of variance for this model along with expected mean squares for fixed 
treatment effects is shown in Table 14.9. 
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Table 14.9 Analysis of variance outline for the split-block design with two 
treatment factors 


‘Seeceoe «Deresq jj Sema =  — feed 
Variation Freedom Squares Mean Square 
Blocks r—| SS Blocks 
A a=i SSA o? + ba4 + rb6? 

~ Error(1) (r — 1)(a— 1) SSE(1) a? + bo? 

B b-1 SSB a; + ao? + ra} 
Error(2) (r — 1)(b— 1) SSE(2) o? + a0? 

AB fa — 1K6—1) peeS' S(AB) a2 +762, 
Error(3) (r — 1)(a — 1)(6—1) SSE@G) o? 


Standard Errors for Treatment Factor Means 


The standard errors for the split-block are more complex than those for the split- 
plot ni Stati because of the altered randomization pattern of factor B treatment lev- 
els. The basic standard errors of the treatment means and treatment differences are 
shown in Table 14.10. 

The degrees of freedom for the standard errors with a single mean square will 
be those associated with the mean square. For those standard errors with two or 
more mean squares it is necessary to obtain degrees of freedom values with the 
Satterthwaite approximation. 


14.9 Additional Information About Split-Plot Designs 


The basic split-plot design and several extensions of the design have been discussed 
in this chapter. Numerous other modifications and applications of the design have 
been presented elsewhere including subunit treatments in a Latin square design, 
confounding of comparisons among subunit treatments, and systematic arrange- 
ment of whole-unit treatments (Cochran & Cox, 1957; Petersen, 1985). Robinson 
(1967) gave a split-plot design and analysis for subplots in a balanced incomplete 
block arrangement and Coons et al. (1989) utilized a split-plot design with whole 
plots in a balanced incomplete block design. The statistical properties of split-plot 
designs with incomplete blocks were given by Mejza and Mejza (1996), and 
properties for the split-block designs with incomplete blocks were shown by Hering 
and Mejza (1997). Federer (1975) included an extensive array of modifications 
along with the connection to series of experiments repeated over locations or time. 
Little and Rubin (1987) and Jarrett (1978) provided detailed relevant discussions on 
the analysis of split-plot designs with missing data, including formulations to obtain 
standard errors for general treatment contrasts. Steel and Torrie (1980) included 
discussions on the analysis of repeated harvest data for perennial crops as split-plot 
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Table 14.10 Standard error estimators for the split-block design 


Treatment Comparison Standard Error Estimator 
2MSE(\ 
Two A means (y,, — Y,,) ease 
: t rb 
2MSE(2 
Two B means (7, — Y,) elas 
ra 


2[(b — I) MSEG) + MSE(1) 


Two A means, same level of B Sg, ; 
. 
2\(a —- 1)MSE3)+ MSE 
Two B means, same level of A (Gj. — Fyv.) 2a — DMSEG) + MSE(2)) 


ra 
Any two means, different levels of A and B (4%, — Yox.) 


2[aM SE(1) + bMSE(2) + (ab — a — b)MSE(3)| 


rab 


designs. Since repeated harvests of the same plot fall under the general category of 
repeated measurements and longitudinal studies, a discussion of that topic is found 
in the next chapter. 


EXERCISES FOR CHAPTER 14 


1. A split-plot experiment was conducted on sorghum with two treatment factors, plant population 

density and hybrid. The whole plots were used for the four levels of plant population density—10, 

15, 25, and 40 plants per meter of row. There were three hybrids randomly allocated to the subplots 

of each plot. The experiment was conducted in a randomized complete block design with four 

replications. The data that follow are the weights of the seed per plant in grams. 

a. Write the linear model for this si ite asc explain the terms, and compute the analysis of 
variance for the data. 

b. Construct a summary table of cell means and marginal means for this experiment, and 
compute the estimated standard errors for the table of means. 
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Head Seed Weight (g) of a Sorghum Trial 
Plants per Meter of Row 
Hybrid Block 10 15 Z2 40) 
TAM 680 1 O07 232 161) 1i2 
2 31.8 444A. WibeeAZ7 
3 320 20 199 14S 
4 a1 344-20.) Toe 


RS 671 I eee 1 TD ae 
2 Ni.0 34.0. 30D. 47a 
> 464 25.6 225° 274 
a $920 15 Ot sla. 250 
15.399 G37 262 205. 169 


] 
x Tx 2536 2 20.2. 46.1. 26.2 . 26.2 
3 448 41.1 30.0 19.2 
+ S93 SGN 24.7. 220) 


Source: Dr. R. Voigt, Department of Plant Sciences, University of 
Arizona. 


c. Compute the estimated standard errors for the differences between two observed means: 
(i) for hybrids 
(ii) for plant populations 
(iii) for two hybrids at the same plant population 

d. Test the hypotheses for interaction and main effects assuming fixed effects for hybrids and 
plant populations. 

e. Compute the relative efficiency of this split-plot design for subplot and whole-plot treatments 
relative to the ordinary randomized complete block design, and interpret it. 

f. Partition the sum of squares for plant population and the interaction sum of squares into the 
appropriate polynomial regression partitions, and interpret the results. Make a graph of the ob- 
served means to assist in the interpretation. The coefficients for linear, quadratic, and cubic 
partitions for the four levels of Plants per Meter are shown below. 


Plants 10 15 25 40 
Linear = 0,546 (20327 0.109 0.764 
Quadratic 0.513, 2 OL up OJ A... 0.399 


Cubic — 0.435 0.783 —0.435 0.087 


2. A split-plot experiment was conducted in a completely randomized design with whole-plot treat- 
ments as a 2 x 2 factorial (factors A and B) and the subplot treatments as three levels of factor C’. 
There were four replications of the experimental units. Assume all treatment effects were fixed. 

a. Write the linear model for the experiment. Identify each of the model components, and show 
the numerical ranges on the subscript. 


EXERCISES 489 


b. Outline the analysis of variance table showing sources of variation, degrees of freedom, and 
expected mean squares. 


Suppose the whole-plot treatments in Exercise 14.2 were arranged in a Latin square design. Repeat 
parts (a) and (b) in that case. 


Suppose the subplot treatments in Exercise 14.2 were a 3 x 3 factorial arrangement of factors C 
and D while all other conditions given were the same. Repeat parts (a) and (b) in that case. 


An investigator in food science wants to conduct an experiment to assess the effect of cold storage 
conditions on food quality. The two treatment factors to be used are storage temperature and con- 
tainer material. The food product will be placed in one of the containers and stored in a temperature 
control chamber for a fixed period of time after which a number of physical and subjective quality 
measures will be taken on the food product in each container. The investigator has three small tem- 
perature control chambers available for the experiment. The storage temperatures for the experi- 
ment are 2°C, 4°C, and 8°C. There are four container types for the study—sealed plastic, sealed 
cardboard covered with wax, sealed cardboard, and an open container as a control. There are four 
positions on the center shelf of the chamber in which the containers can be placed. Draw a diagram 
of a plan for the experiment so that the investigator may have three replications of the experiment. 

Use the following guidelines to construct your diagram. 

a. Use the labels I, II, and III to identify the three temperature chambers and the labels a, 5, c, 
and d to identify the four positions within each chamber. 

b. Show the container type (C1, C2, C3, or C4) assigned to each position in each chamber as well 
as the temperature (2°, 4°, or 8°) assigned to the chamber for all three replications in your 
diagram. | 

c. Show the randomization scheme you used to make the assignments in part (b). 


A research specialist for a large seafood company investigated bacterial growth on oysters and 

mussels subjected to three different storage temperatures. Nine cold storage units were available. 

Three storage units were randomly selected to be used for each of the storage temperatures. Oysters 

and mussels were stored for two weeks in each of the cold storage units. A bacterial count was 

made from a sample of oysters and mussels at the end of two weeks. The logarithm of bacterial 
count for each sample is shown in the table at the end of the exercise. 

a. The investigator could have had three replications for the study by simply taking three random 
samples of each seafood from a single cold storage unit set at one temperature. In this way 
only three cold storage units would have been needed for the study, one for each temperature. 
Explain the potential difficulty with the study if it had been conducted in this way. 

b. Is there a significant increase in bacterial growth as temperature increases? Justify your 
answer. 

c. Is there a significant interaction between type of seafood and increase (if any) of bacterial 
growth with temperature? Justify your answer. 

Write the linear model for your analysis, state the assumptions, and explain the terms. 

e. Determine whether your assumptions about the linear model are correct for this data. 
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Storage Unit Temperature (°C) Seafood* log (count) 


0 l 3.6882 
l 0 2 0.3565 
2 0 ] 1.8275 
Z 0 e E.7023 
3 0 32327 
3 0 i 4.5780 
4 5 l T4950 
4 5 2 5.0169 
5 3 1 9.3224 
, 5 2 T9519 
6 7 7.4195 
6 5 Z 6.3861 
7 10 | 9.7842 
fi 10 2 10.1352 
8 10 l 6.4703 
8 10 2 5.0482 
9 10 l 9.4442 
9 10 2 11.0329 
T= oysters, 2 = mussels 


7. A split-plot experiment in a randomized complete block design evaluated the effects of nitrogen, 
water, and phosphorus rates on the water use efficiency in a drip irrigation culture of sweet corn. 
Two rates of phosphorus (P; = 0 and Pz = 245 lb P,O;/acre) were randomized to whole plots in a 
randomized complete block design. The 3 x 3 factorial treatments of nitrogen (0, 130, and 260 Ib 
N/acre) and water (16, 22, and 28 inches) were randomized to subplots within each of the main 
plots. The data shown are water use efficiency for each subplot. 


a. 


b. 


Write a linear model for the experiment, explain the terms, and conduct the analysis of vari- 
ance for the data. 

Construct a summary table of cell means and marginal means for this experiment, and compute 
the estimated standard errors for the table of means. 

Compute the estimated standard errors for the differences between two observed means: 

(i) for phosphorus rates 

(ii) for water levels 

(iii) for nitrogen rates 

(iv) for water by nitrogen cell means 

Test the hypotheses for all interactions and main effects, and interpret the results. 

Partition the sum of squares for water and nitrogen into linear and quadratic polynomial regres- 
sion partitions including interaction. Interpret the results utilizing a graph of the observed 
means to assist in the interpretation. 


EXERCISES 491 


Block 1 Block 2 
Water Nitrogen Py P» P, P» 
16 0 Sl. 9.7 $0 15:5 
130 36.0 34.2 4331 
260 34.6 34.0 40.7 39.3 
22 0 100° 62 5.1109 
130 oe eee 3 ae 90: 219 
260 30.7 289 208 250i 
28 0 10.6 6.3 45 104 
130 ISA 197 21 TGS 
260 Don 2 19.4 23.2 


nae ee Se a ne SP APN en 
Source: Dr. T. Doerge, Department of Soil and Water Science, University of 
Arizona. 
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Repeated Measures Designs 


The central topic of this chapter concerns repeated measurements of the response 
variable on each experimental unit. Experiments with observations made on suc- 
cessive occasions in time are emphasized. The statistical properties of the observa- 
tions are discussed, and suitable methods for analyzing the data are demonstrated. 


15.1 Studies of Time Trends 


492 


The time trend of individual responses to treatment is an important aspect of many 
experiments. Examples include experiments in which animals are weighed weekly 
to monitor growth under different nutrient conditions or field plots of perennial 
crops such as alfalfa are harvested several times in succession. Repeated ‘measures 
occur r frequently in clinical trials when patients are measured at regular intervals to 
monitor the response to medical treatment. 

Repeated measures on each experimental unit provide information on the time 
trend of the response variable under different treatment conditions. Time trends can 
reveal how quickly the units respond to treatment or how long the treatment effects 
are manifested on the units of the study. Differences in these trends among the 
treatments also can be evaluated. 


Repeated Observations Result in Increased Precision 


Repeated observations on the same experimental unit over time are often a more 
efficient use of resources spon aes use a different Se unit si each 
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Example 15.1 Early Detection of Phlebitis in Amiodarone Therapy 


Treatment Design: An experiment described in detail for Example 2.2 was 
designed to explore mechanisms for early detection of phlebitis during Amio- 
darone therapy. Phlebitis is an inflammation of a vein that occurs upon intra- 
venous administration of drugs. Three intravenous treatments were 
administered to test animals: (1) Amiodarone with a vehicle solution to carry 
the Te (2) vehicle solution es and (3) a saline solution. 


senile Peete’ was prerercm 2 to the ebb Suniel an intravenous 
needle inserted in a vein of 0 one ear. = pes poe abe of both ears was moni- 


3 e : ear was 
é red a possible earl ator of phlebitis “The difference in the tem- 
peratures 0 of the two ears eoenaeeed minus aes was used as the response 
variable. 


Repeated Measurements: The temperatures were observed oe =* minutes 
in eas of ae eats igs the oe of the study. ' The ervations, ma 


A Profile Plot Reveals a Trend 


The observed trends over time for a os treatments in the iin sa —_ are 
_ in Figure 15,1. . s in 3 airs 


> treatment indicates only a fluctuating = ae over tim trend. 

“The ks of ive analysis for this study will be to icin chietiie there 

is a significant upward trend in the temperature for any of the treatments. If so, it 

will be important to determine whether the Amiodarone is responsible for any 
significant temperature increase rather than the vehicle eenion G or the intravenous 


ctdeets: ii shies oa one cece A 
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Table 15.1 Ear temperature differences (°C), treated minus untreated, of rabbits 
at 0, 30,60, and 90 minutes after treatment 


Time of Observation (minutes) 
Treatment Rabbit 0 30 60 90 


Amiodarone | — 0.3 — 0.2 1.2 | 
2 — 0.5 22 ee at 
3 — 1.1 2.4 fe aa 
4 1.0 ‘7 Zl io 
5 — 0.3 0.8 0.6 0.9 
Mean — 0.24 1.38 1.88 2.58 
Vehicle 6 — 1.1 —2.2 0.2 0.3 
7 — 1.4 — 0.2 — 0.5 — 0.1 
8 — 0.1 — 0.1 — 0.5 — 0.3 
9 — 0.2 0.1 — 0.2 0.4 
10 — 0.1 — 0.2 0.7 — 0.3 
Mean — 0.58 — 0.52 — 0.06 0.00 
Saline 11 — 1.8 0.2 0.1 0.6 
2 — 0.5 0.0 1.0 0.5 
13 — 1.0 — 0.3 — 2.1 0.6 
14 , 0.4 0.4 — 0.7 — 0.3 
15 — 0.5 0.9 — 0.4 — 0.3 
Mean — 0.68 0.24 — 0.42 0.22 


Source: G. Ward, Department of Pharmaceutical Sciences, University of Arizona. 


# Amiodarone 
@ Vehicle 
¢ Saline 


Temperature difference 
1 


0 30 60 90 
Minutes 


Figure 15.1 Profile plot of the means for each treatment at each time period for 
the Amiodarone study 
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—— oss sa in Eel ta a 


eel Bee or more a can be administered to each of the sub- 
jects. Suppose athletes are used as subjects for a study on exercise physiology. 
After a training regimen is completed the athletes are given tests on a treadmill set 
in two different positions, horizontal and inclined. The objective is to determine 
whether differences exist between the results of horizontal and inclined treadmill 
tests. Each athlete is a block in a randomized complete block design if the treadmill 

_ tests are administered to the athletes in random order. 


15.2 Relationships Among Repeated Measurements 


The relationships among the observations govern the statistical methods required 
for the particular research design used in a study. The correspondence of the rela- 
tionships to the method of analysis for repeated measures designs is explored in this 
chapter along with some useful strategies for the analysis. 


Correlated Observations Among Repeated Measures 


seumaiede: in time. @ ecervations at 0 and 30 minutes on any one of the rabbits in the 
Amiodarone study are assumed to have a larger correlation than observations at 0 
and 90 minutes. nee 


The correlation between ine Variable’, say y; and yo, is defined as 


pics e3 hes sche are the standard deviations of y; and y2 and ont 


sheoncitoal variances oe covariances for repeated measures taken ens as 
Y15 Y2, y3, and yq are illustrated in Display 15.1 as the 4 x 4 » matrix. 


Analysis of Variance Assumptions 
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Display 15.1 atrix of Variances and Covariances 
for Four Repeated Measures 


shown in aes ae 1. > value for all 


Compound Symmetry Means Equal Correlation Among Repeated Measures 


A particular experiment with randomization of treatments to experimental units 


is only a random ae ws all possible bantonizes is pecs that could have 
been used. e ¢ of ra Jom ization - aoes not e 7e th ~ 


+} 


=e ren en 


eee eee ons ¢ i 


The matrix oe variances ett covariances with coursalne Pimnetry 3 is shown in 


Display 15.2. 


Display 15.2 


Matrix of Variances and Covariances of Four 
Repeated Measures Under Compound Symmetry 


Y¥3 Y4 


Split-Plot Treatments Are Randomized; Repeated Measures Are Not 
Randomized 


The assumption of compound symmetry was used for the errors of observation 
in the split-plot experiment in i 14 because treatments \ were aint as- 
signed to the subplots. The subject in the repeate es design | 
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quivaient to the w lot treatmer : lot de 1. The repeated 
measure on a a subject is Ne to the Sioplot in ithe splitplot sa an F Tr 


tor. Fie-split-plot analysis of variance was echibagd igeChapisr 14 for various 
between-subject, or whole-plot, experiment designs. 
The Huynh—Feldt Condition Less Stringent than Compound Symmetry 


Huynh and Feldt (1970) showed that conditions required for the usual analysis 
of variance for repeated measures Gens were less stringent = the a 


Symes condiuon, : | sam Z 
Ce “Ce TOr all POSSI1D. > ps ITS OF ODSEI at 5 a Nn at dui ferel = wis e 
y yi and y Bor fue polwf rt» Hat the variance 
noice of the d; erence 
or 7 : 
j of tay owls 
for some > 0. This condition also can be stated as (5 Cowsfa 
\ fori # j (15.3) 


ndition is known as the 
HO x. The mean squares from an analysis of variance can be used to test 
: ngnetieses about the within- -subjects treatments if the Huynh—Feldt condition is 
satisfied. 


Ci &) are 


If Each Subject Receives All Treatments 


‘fal unit. For example, the stitidembie cost of fhaiiaining asi cans 
makes it eueeiant to obtain as much information about the treatments as possible 
from the individual animals. Also, the variability of observations among animals 
tends to be much larger than that among multiple observations on the same animal. 
Thus, blocking on animals with all treatments administered to each animal in- 
creases the precision of treatment comparisons. 

When each of the treatments is administered in random order to each subject, 
for example, B+D-A-C, then subjects are random blocks in a randomized com- 
plete block design. The expected mean squares for the randomized complete block 
design are shown in Table 15.2. lt is a aan ie model pees with Sasiseaies = 
si sas treatment effects. The statistic, Fy = MST, null F 
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Table 15.2 Expected mean squares for the randomized complete block mixed- 
model analysis of variance 


Source of Degrees of Mean Expected 
Variation Freedom Square Mean Squares 
Blocks r—1 MSB a” + te; 
Treatments t-i Moti, o' +70? 
Error é.— Dr) MSE ~= — @? 


Carryover Effects 


effects can Paerously bias the eonmaigs Port reamient t means pecsuse treatments 
administered in previous periods influence the effects of the treatments | in succeed- 
. Ing periods. 
daniel ies we Carryover effects are sadiculiets troublesome with animal and human subjects 
“S855. With successive administration of dietary or "medical treatments that affect = 


: : 
ee a 


i of the oi: 


SOA SES 


adie are the — of a 16. 


15.3 A Test for the Huynh—Feldt Assumption 


A univariate analysis of variance can be used under any of the three alternate sets 
of assumptions sss the hes sees measures discussed i in oe 15.2. They were 


Type H H matrix can wee evaluated evils a test ¢ attributed —t : (1940). The test is 
demonstrated in this section. 


h the Amiodarone study of Example 15.1 PRetall the xen ne idee 
treatments iam to pune neato units in a ‘completely randomized design, and 
the —— units \ were measured on four successive occasions. 


for ©, is neinitetie by sie shambiide ce tate ac programs for the 
sivlineis of repeated measures designs. A brief outline on the details of the test sta- 
tistic is given in Appendix 15A.1. 
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The result of the Mauchly W test for the ——— study computed seis a sta- 
ech Eee is sans in aa 15. 2. | 0.857 


dom Fsshste s pis vthe number of repeated m measures. s Fo or the Amiodarone 


| “loa ¥=4G)/2 - | aa52 


Table 15.3. Results of the Mauchly test for a Type H & matrix 


y Tene aT «sé rae 
Phe a ee 

Cov. meahriK does _ Chi-square approx. = 1.72 with 5 D-F. 

pot re HF shauct P(xg > 1.72) = 886 & Eat rs | 


MAuG ai made. null hypothesis is is not very void unless the experiments have a large number of 
replications. Boik (1981) indicated power of less than .20 for some specific cases of 
the Mauchly test when there were three treatment groups and as many as 12 sub- 
jects per group. Consequently, complete reliance on = test i not ¢ oc 


15.4 A Univariate Analysis of Variance for Repeated 
Measures | 


If we can reasonably assure ourselves that the analysis of variance a ssumptions are 
valid for repeated measures on each rabbit in the treatment groups, the split-plot 


analysis of variance mean squares can be used to test i abor t the treat- 
ment means and their interactions oa time. ~The 


Agee SIA 


The 
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(15.4) 


where pz is the general mean, a; is the effect of the zth treatment, d;, is the random 
experimental error for rabbits within treatments with variance 03, (3; is the effect of 
the jth time, (af),; is the interaction between treatments and time, and e;;, 1s the 
normally distributed random experimental error on repeated measures with vari- 
ance a2. The split-plot analysis of variance for the data from the Amiodarone study 
is shown in Table 15.4. The split-plot analysis of variance was exhibited in Chapter 
14. 


Table 15.4 Split-plot analysis of variance for repeated measures from the 
Amiodarone study 1 in a completely randomized design 


Source of .°. Degreesof Sumof Meanssiearie “oy eke 
Variation Freedom Squares Square eee ee 
Total 59 93.28 [SS SS 
Treatment (A) 2 35.38 17.69 i944 SY M00. 
Error(1) 12 10.94 0.91 ee 
Time 3 16.08 5.36 9.24 0.000 
A x Time 6 10.06 1.68 2.90 0.021 
Error(2) 36 20.82 0.58 


Use Regression Contrasts on Repeated Measures to Study Time Trends 


The global test for significance of the interaction between treatments ge time a 


- The interaction should - sonra as a aicieuciawe i in yee over eae 


among the treatments. 

The observed trends over time for the three treatments in the Amiodarone 
study were shown in Figure 15.1. The profile plots exhibited an increase in the ob- 
served temperature differences for the rabbits in the Amiodarone treatment. Less 
definite trends were observed with the vehicle and saline treatments. 
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the analysis of variance have been illustrated several times in previous chapters (for 
example, see Example 6.3). 

Linear, quadratic, and cubic regression sum of squares partitions can be com- 
puted for time with corresponding partitions for the treatment x time interaction. 
The sum of squares partitions for the Amiodarone study are shown in Table 15.5. 
The polynomial regression sum of squares partitions for the split-plot analysis of 
variance were exhibited in Example 14.1. 


Table 15.5 Split-plot analysis of variance with polynomial contrasts for repeated 
measures from the Amiodarone study 


Source of Degrees of Sum of Mean 
Variation Freedom —_ Squares Square | eae 2 a 
Total 59 93.28 
Treatment (A) p: 35.38 17.69 19.44 0.000 
Error(1) 12 —10.94 0.91 
Time (T) 3 16.08 5.36 9,24 0.000 
T linear l 14,52 14.52 25.03 — 0.000 
T quadratic l 0.60 0.60 1.03 0.315 
T cubic l 0.96 2 6 1.66 0.205 
ALK T 6 10.06 1.68 2.90 0.021 
4 x T lineat 2 7.80 3.90 6.72 . MOOS: 
A T ee 2 0.56 0.28 0.48 0.622 
A x T cubic 2 i 0.85 1.47 0.242 
Error(2) 36 20.82 0.58 


It was determined previously that interaction between the treatments and time 
was significant, but the hypotheses tested with the mean squares in Table 15.5 pro- 
vide more specific information about the form of the ls sdeamma ees inser 
Sais i escion dk and linear eee on time, S Significant with © 


treatment ups. The linear aie Qi(1), are Sc ianieeel for each of the treatment 
groups 08 computing the linear contrast among the time means for each treatment, 
where aj,) is the linear contrast for treatment 7. For example, the means for the 
Amiodarone treatment group at 0, 30, 60, and 90 minutes were — 0.24, 1.38, 
1.88, and 2.58 in Table 15.1. The linear contrast for the Amiodarone treatment 
group is eee: 
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5 = C3X0.24) + C1K1-38) + (10.88) + ESE) 
‘e ee) te ae Ue ee oe 


8.96 
ie Page 


= 0.45 


with standard error 


The linear contrasts for the vehicle and saline treatments are ag) = 0.11 and 
a3(1) = 0.10, wate shes “ <cael error See = 0. . 


The Bonkefoni t na 36 sabes of Peden since the mean square hap error, 
MSE(2) = 0.58 from Table 15.5, has 36 degrees of freedom. The 95% SCI are 
computed 2 as Selle = 2. abn nia 


15.5 Analysis When Univariate Analysis Assumptions Do 
Not Hold 


When the Huynh—Feldt condition is not satisfied for the repeated measures the re- 
sults from the univariate analysis illustrated in Section 15.4 are not valid. Several 
alternative analyses are suggested when the usual analysis of variance cannot be 
used. 


Three Choices for the Analysis 


epi but general multi- 
variate methods are beyond the intent of this book and their direct use is not con- 


sidered here. 


statistics fro: é ariance to brcier apenas the ie eheance jack 
of he tests. Vis inaeee all be illustrated in this section. A third alternative — 
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> repeated measures. The annbysis of Ceiaiaats | uses eteituses of maliveents analy- 
sis etic can 77 aeliod3 in a straightforward manner to repeated measures. The study 
described in Example 15.2 resulted in repeated observations on the experimental 
units for which the Huynh—Feldt assumptions were not valid. 


Example 15.2 Soil Moisture and Soil Microbe Activity 


A productive agricultural soil requires a certain level of soil aeration to main- 
tain active plant root growth and soil microbial activity. A soil scientist found 
that soil aeration levels had been affected in soils fertilized with the nutrient 
rich sludge by-product from a sewage treatment plant. The aeration level of 
the soil can be reduced by the high water content of the added sludge soil; 
through compaction by heavy machinery used to add the sludge; and, ironi- 
cally, by the increased microbial activity caused by adding the high-organic 
sludge material to the soil. 


Research Objective: One objective for this particular study was to determine 
moisture levels at which soil aeration became limiting to microbial activity in 
soils. 


Treatuvent anil eas treatments ce a control soil treatment with no 


Patcehe Soil, 


re 0.24 "026 and 0.28 kg 


Cc oi sisal pe dace oe soil were randomly assigned | 
catments in a completely randomized desig the qeged a sani! were 


aieea” in Pattight containers aad iidibated ander conditions conducive to 
microbial activity. The soil was compacted in the containers to the degree ex- 
perienced in the field. 

Microbial activity, pion le as saci evolution, was used as a measure of 


fainer on days 2. fe 6, anda ates thie seenntae of itie a hatch oneae 
The acrebial activity in each soil sample was recorded as the percent in- 
crease in CO, produced above atmospheric levels. The data are shown in 
Table 15.6. 


The Univariate Analysis Assumption Is Not Valid 


The Mauchly test of whether the Huynh—Feldt condition holds for the repeated 
CO, measurements is shown in Table 15.7. The Mauchly statistic is W = 0.180, 
and the chi-square distribution APRIe NAOT to ag Hea: 2 se =e — ee —] 
ates sec ole mie Baers a Peadon, Che onifican evel for the test is 
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Table 15.6 Repeated measures on CO» evelnizat from microbial activity in soil 
under different moisture conditions tiay as 


% CO , Evolution/kg Soil/Day 


(Kg water/Kg soil) 

Moisture Container Day2 Day4 Day6 Days 
Control l C2 O56 C66 (89% 

. 0.68 0.91 1.06 0.80 

3 0.68 -045 0.72 . 0.89 

Mean 0.53 ~064- 0.81 0.86 

0.24 4 2153 “ye 240 1.50 

5 L297 oe L3a O.74 

6 0:36. aie tes 21 

Mean Loo ee fy sD 

0.26 7 O22 =e Oe Oli 

8 0.45 = 028 1.24 . 0.86 

9 ‘0,22. G33 9 U20 

Mean 0.30 0m28 2 O39 - 0,39 

0.28 10 0.225 .A80- O50 —§U0.3/ 

1] O22 =3062- Geo 0.95 

bg 0.22 03632 069 . 0.63 

Mean O22... 1.00. 0.79 UGS 


Source: Dr. I. Pepper and J. Neilson, Department of Soil and Water Science, University of Arizona. 


Table 15.7 Results of the Mauchly test for the Huynh—Feldt condition for CO, 
measurements on soil samples . 


Mauchly Sphericity Test 
W = U.160 
Chi-square approx. = 11.52 with 5 D.F. 
Pye > 11,52) Of 


The Errors of Inference are Compromised Ems 


Given a significant Mauchly test we can assume —_ from the’ univariate 
reste of pee Sa) ee eee 01k (1981) showed that very small depar 


ment in Phe case of the current study. One éf ii earliest compromises to the 
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univariate analysis of variance was to adjust the values of the computed Fp statistics 
in the analysis. 


Adjustments to Univariate Test ara 


AaAc nn 


(Box, 1954a, 1954b) ‘Greenhouse and G Geisser 1959) suggest Ee — maser ae — - 
Doses on the seat sci Box; where numerator and d : 


ficant, T .. The eiealaion of t the? E s bieuenta is ichowane iapeendix 15A.2, mee it is 
comme in most statistical programs for repeated measures analysis. 
Huynh and Feldt (1976) — a less conservative Baas ia at the 


stment. The Huvole-Felds 2 € ad- 
justment i is computed en most programs for Sener measures analysis, and the 
€ ps hain eo nor is shew in Appendix aa aoe _ 

ee Oe  e - 2 lit-plot analysis of variance Fo sta- 
3 2d measures es: . No dinates are necessary for tests 
bait wholes sot or i cinemas aibaed treatment factors since the treatments are 
randomly assigned to the experimental units. The split-plot analysis of variance for 


the repeated measures is shown in Table 15.8 with Moisture treatments as the 
whole-plot factor and Day as the subplot factor. 


Table 15.8 Analysis of variance for CO. measurements on soil samples (SAS- 


GLM) 
es ty 
aN Source of Degrees of Sumof Mean G-G IEF 
| Variation Freedom Squares Square fe PPS FF Pr > F- Pr> Ff 
cig yy Moisture 3 [ise 8S E15" 8.003 
\ Error(1) oS se 0.35 
é This bsed | day 3 pag es 9.524 0.317 0.324 
bv BUY Ati Day x Moisture 9 ks) G17 2. U:296 *0330 0.304 
Error(2) es 0.13 
Makirg Lhic 
Me ty) od “Significance level after adjustment with Greenhouse-Geisser epsilon = 0.5245 Not Hoe 
ihe pee re aes level after adjustment with Huynh—Feldt epsilon = 0.8755 
WA! COM PA 


70 phe opher et The Greenhouse-Geisser @ ade shown at ee bottom of an ae a is 
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The Greenhouse—Geisser € adjustments to the numerator and denominator de- 
grees of freedom for the test of interaction between moisture levels and days, 
Day x Moisture, are 9(.5245) = 4.7 and 24(.5245) = 12.6, respectively. With the 
adjusted degrees of freedom the probability of exceeding f= = 26 is 
— > pies = sini a the oe sata was .296 without the adjustment 
Pon ee of th stment Is t sase the P-Value wis the ef statistic to a 


canaeieeee é Huyni-Feldt aanuitoens result in n 9A 87 755) = = of 9 ad 24(. 8755) = = 21 
degrees of freedom for the test with Pr > F' = .304. 


cabo the interaction sesssvone moisture levels and prince was not Snap ct 


In aiken case the main effects for days are not significant. 
The F test results for main effects of moisture treatments is the usual Fo for 
whole- -plot main effects in the ego analysis of v variance. The calculated value 


F = .003, and the null hy- 
ar ” g ‘ee soil moisture 


Tests among the some moteniee treatment means require standard errors based 
on the whole-plot equivalent error mean square, MS E(1) = 0.35, in Table 15.8. 
For example, the standard error of the difference between two means is 


\/2(0.35)/(3)(4) = 0.24, given r = 3 replications and p = 4 repeated measures for 
each main effect mean. 


Contrasts on the Repeated Measures Provide Specific Inferences 


tions of enteral canéequenes usually involve interesting contrasts anene the treat- 
ment means. An alternative analysis for repeated measures is based on important 
contrasts among the repeated measures. The analysis provides the appropriate test 
statistics from multivariate methods while using the familiar univariate analysis of 
variance. 


The analysis requires the calculation of a contrast among the repeated mea- 
sures for each experimental unit. The values of the contrast are used as if they were 
the original observations on the experimental units, and a univariate analysis of 
variance is computed from the observed contrast values. 


Polynomial Regression Contrasts to Study Time Trends 


ymial re pression contrasts are the most useful statistics | 
yestigating th d over time. For example, the linear contrast for CO, evo- 
lution’ over time in the <i moisture study is calculated for each soil sample as 


15.5 ANALYSIS WHEN UNIVARIATE ANALYSIS ASSUMPTIONS DO NOT.HOLD 507 


2451) = (— 3) ijt) + C— Diz) + CO) yig3) + 3) uiza) 


The linear model for a contrast c is 


- Tie) + €ix(e) (15.5) 


where Z;,-) is the contrast value for the jth experimental unit on the 7th treatment, 
lc is the general mean for the contrast, 7;(,) is the treatment effect for the contrast, 
and e;,-) is the normally distributed random experimental error for the contrast 
with variance o2.. 

The contrasts of interest for the current study are the linear, quadratic, and cu- 
bic polynomial contrasts computed from the repeated observations on each of the 
soil containers found in Table 15.6. The resulting contrast values for each container 
are shown in Table 15.9. For example, the linear contrast for the first container in 
the control treatment is 211) = ( — 3)(0.22) + (— 1)(0.56) + (1)(0.66) + (3)(0.89) 

= 2.11. PA Z 

Cov | Day2 Cou) Day Cowl, Dy6 cory days 
Table 15.9 Linear, quadratic, and cubic contrasts for three soil samples from each 
of four treatment groups in the CO2 evolution study 


Contrasts 
Treatment Unit Linear Quadratic Cubic 
Control 1] a 0.37 
2 0.51 — 0.49 — 0.33 
3 0.90 0.40 — 0.60 
Mean atch — 0.07 — 0.19 
0.24 4 —3.69 — 0.77 0.77 
5 —5.63 0.55 — 1.61 
6 24) — 1.47 — 0.85 
Mean —2.29 — 0.56 — 0.56 
0.26 7 —0.35 — 0.09 — 0.05 
8 2.19 — 0.21 — 2.47 
9 —0.05 — 0.25 — 0.05 
Mean 0.60 — 0.18 — 0.86 
0.28 10 0.45 — 1.01 0.15 
11 2.46 — 0.34 — 0.08 
iz 1.36 — 0.40 0.02 
Mean 1.42 — 0.58 0.03 


Linear(P;;),(— 3, — 1,1,3); quadratic(P,;),(1, — 1, — 1,1); cubic(P3;)(— 1,3, — 3,1) 
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A Separate Analysis for Each Contrast 


The analyses of variance computed for the linear, quadratic, and cubic con- 
trasts are shown in Table i= 10. bbs oo of variance follows the form used in 
Example 15.1. Pe ontras were computed for the day factor along with 

| dheir-interactiontai ont isture fact 


Table 15.10 Analyses of variance for polynomial contrasts on day of CO2 mea- 
surements from soil samples 


Source of Degrees of Sumof Mean 

Variation Freedom Squares Square F ProP 
Day linear l 0.031  Wesr 0.11 0.744 

Day linear x Moisture 3 1.320 0.440 1.64 0.255 

Error _ | 2.1435 ee 


Day quadratic 0.366 0.366. , 4.19 0.075 
Day quadratic x Moisture 3 0.156 0.052 0.60 0.635 


| Error 0.698 0.087 
dP heve | 
aw alyséS 
are jpazt) on Day cubic 1 6.093 Qe ae 9,193 
BAF 'S, This Day cubic x Moisture 3 0.070 0.023 0.50 0.690 
makes them Error () 0.369 0.046 


Move’ Coarse atl ve. 
The primary difference from the analysis for Example 15.1 is the computation 


of separate error sums of —— for ose of the contiess des their interaction with 
moisture. res ¢ rimental error ce from the “Error" sour ce of © 


and cubic ‘contrasts ie Tespectively. — siisical anne oe ee can produce 
thee analyses for repeated measures designs. 


The Test for Interaction Between Treatments and Trends 


ture + teetinenis. ‘The Fi statistic tests the tolled: Ao: Tic) = 9 for all Moisture 
treatments. 

The test statistic for no interaction between the linear contrast of time and 
Moisture treatments in the analysis of variance for Day linear, Fo = MS(Day lin- 
ear x Moisture)/s2, = 0.440/0.268 = 1.64, and with Pr > F = .255 we conclude 
there is no interaction. Likewise, the Fo statistics for Day quadratic and Day cubic 
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interaction with Moisture treatments test the hypotheses of no interaction between 
the Paiste ¢ treatments and the : Seas ane ibs contrasts. 
ice none of the interactions were signifi | test hypotheses concern- 


nd in “COy evention t over time. The mean squares for 
each of the contrasts are used to test these aupatheses about the main effect of time. 

The statistic, Fy = MS(Day linear)/s?, = 0.031/0.268 = 0.11 tests the hy- 
pothesis that the linear contrast is zero rece treatments, and the hypothesis is not 
rejected with Pr > F = .744. Likewise, the Fo statistics for Day quadratic and Day 
cubic test the hypotheses that the quadratic and cubic contrasts are equal to zero for 
all sicamaiie, and neither contrast is Sepaeanit 


ing SHE ex sieie Of aT 


evolutic m COO RIT aneteeaerine tie fee cit days of PS HRUBREOT er peri- 
ea Fue oe the analysis of variance in Table 15.8 with a significa 
main effect indicated the average levels of CO2 differed among the soil treatments. 


The Analysis of Individual Contrasts Are More Conservative 


The gic oa of the individual contrasts are not Vig aeae free of drawbacks. The 
tests based on the analysis of the individ ntras 10 

ae cant TE powerful heatr-thowe 2m on a usual split-ptot analysis in 
Table 15.8. The F' tests for the contrasts are based on error variances with 8 de- 
grees of freedom in Table 15.10, whereas tests for the same contrasts from the 
split-plot analysis were based on an error variance with 24 BRS of freedom in 
Table IS. 8. + eae the Greenhouse—Geisser or Huynh—Fe fments 


Contrasts May Have Different Experimental Error Variances 


The analysis based on individual contrasts illuminates a common occurrence in 
the analysis of data wherein the error variances associated with different contrasts 
can be quite disparate. The disparity in the error variances for polynomial contrasts 
in the soil microbe study can be seen in Table 15.10. The individual ahpeee 
error mean squares for the linear, quadratic, and cubic contrasts were s?, = 0.268, 

s?, = 0.087, and s?, = 0.046. Thus, a threefold difference exists between the error 
variances of the linear and quadratic contrasts ane almost a sixfold er be- 
tween the linear and cubic contrasts. The error var he unive 
—_* Table 15.8, MSE = 0.13, is 

ntrasts. In certain settings the disparity between the error variances for the sane 
-ieal contrasts and their pooled value in the anes moh pals of variance can lead 
to contradictory conclusions. Therefore, it is recommended that the potential. 
parity among error variances eer a ecoup BBE ponies vet avast Tian arene 
study. 


e average eee variance 1 8 te" ives 
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15.6 Other Experiments with Repeated Measures Properties 


An analogy has been made in this chapter between the repeated measures on an 
experimental unit and the subplots in a split-plot experiment. Several features of the 
subplot in a traditional split-plot design distinguish it from a repeated measure. The 
subplots are usually distinct experimental units of a smaller size than the whole- 
plots. The subplot treatments are randomly assigned to the distinct subplot experi- 
mental units. These two properties of subplots allow a reasonable assumption that 
the variance and covariance structure of the observations are compatible with the 
requirements for the traditional analysis of variance. | 

The subplots in a traditional split-plot design ordinarily represent spatial vari- 
ability as opposed to the time variability associated with repeated measures. The 
covariance relationships between the observations on the subplot treatments would 
be similar to those for repeated measures in time if the subunit treatments are not 
randomly assigned to the subplots. Randomization is not possible in certain types of 
split-plot experiments that have a spatial distribution of subunits. 


Repeated Measures in Space 


Consider a study for which an animal physiologist had hypothesized that size of tis- 
sue segment would affect the results of an assay for tyrosine concentration. Four 
segments of different size were taken from the same diaphragm muscle of an indi- 
vidual animal, and the tyrosine assay was conducted for each segment. 

The four segments of different size from a single diaphragm muscle represent 
within-subject treatments where the individual diaphragm muscles are the subjects. 
The experiment design is a complete block design with diaphragm muscles as com- 
plete blocks and the muscle segment sizes as treatments. 

The four segments can be taken from random locations of the diaphragm mus- 
cle but the spatial relationships among the segments are unknown. If there are dif- 
ferent. correlations between the segments of different sizes, then the usual 
assumptions for the randomized complete block design are not appropriate and a 
repeated measures analysis should be considered for the data. 


Gradient Treatment Designs 


Experiments with a gradient treatment design for the subplots illustrate a second 
type of a split-plot experiment without randomization of treatments to the subplots. 
Examples include experiments with sprinkler irrigation for agronomic crops used to 
create a gradient treatment design. Typically, the objective of the experiment is to 
ascertain the drought tolerance properties of several crop cultivars. The cultivars 
are randomly assigned to field plots in a randomized complete block design. The 
design is illustrated in Figure 15.2 for an experiment with five cultivars in each of 
two complete blocks. 

A line sprinkler irrigation system is placed between the two blocks of cultivar 
plots. The sprinklers on the system can be adjusted to emit a high amount of water 
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Block 1 Block 2 
Irrigation Gradient Irrigation Gradient 
nr entra men neem sree enn nnn armament nina mean 
Low Medium High High Medium Low 
Cultivar Cultivar 

A D 
C B 
B E 
D A 
E © 

ow Medium High igh Medium 

Line Cl ee 


Figure 15.2 Illustration of a gradient treatment design for crop cultivars 


close to the sprinkler line and a lesser amount of water in a gradient away from the 
sprinkler line. Consequently, a water gradient treatment is established along the 
length of each cultivar plot. Crop yields and other measurements are taken from 
subplots established on the irrigation gradient of each cultivar plot. 

Three subplots are shown for each cultivar whole plot in Figure 15.2. The sub- 
plot treatments shown as “high,” “medium,” and “low” are the amounts of water 
applied to the subplots. The water levels are not randomly assigned to the subplots. 

The usual split-plot design assumptions may not be appropriate for the obser- 
vations on the subplots in the absence of randomization. It is more appropriate to 
consider the observations as repeated measures observations and proceed with an 
analysis of the data using the methods outlined in previous sections of this chapter. 

The F test for the main effects of the cultivars will be valid in the whole-plot 
analysis of variance provided the cultivars are randomly assigned to the whole- 
plots in each block. The research question of initial interest in these studies relates 
to the differential performance of the cultivars over the water gradient treatment. 
The statistical test of cultivar x water-level interaction from the within-subjects 
subplot analysis will address that hypothesis. 


15.7 Other Models for Correlation Among Repeated Measures 


The most general structure for correlation among repeated measures is that shown 
in Display 15.1 in which all variances and covariances, with o;; = o;;, have the po- 
tential to be unique. Two versions based on assumptions that simplify the structure 
are either compound symmetry (Display 15.2), or the Huynh—Feldt conditions for 
equality of variances between all differences on repeated measures (Equation 15.2). 
Under these conditions a straightforward split-plot analysis of variance will provide 
valid inference. 
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Derived Variables Analysis When Analysis of Variance Assumptions Do Not 
Hold 


Highly structured laboratory or field experiments such as the Amiodarone study in 
Example 15.1 or the soil microbe study in Example 15.2 are often effectively ana- 
lyzed with the analysis used for the soil microbe study. That type of analysis may 
be considered an analysis of derived variables, which in Example 15.2 were 
orthogonal polynomial contrasts derived from the repeated measures over time. 

Other variables can be derived from the repeated observations on each unit, de- 
pending on the nature of the study. For example, if the repeated measures reflect 
growth or wear response it may be quite reasonable to derive variables based on a 
nonlinear growth or wear model. In those cases, the derived variables would be 
parameters such as slope, asymptote, point of inflection, and so forth. 

The advantage of this approach is its simplicity through inference by standard 
univariate analysis of variance. One of the disadvantages is that the derived vari- 
ables are not necessarily independent; thus, the inferences for the derived variables 
are not independent of one another. The derived variable approach breaks down in 
studies with incomplete data because of missing values at some points in time, or in 
less structured studies in which subjects may be measured at different points in 
time. Under these circumstances the common variance assumption for standard 
analysis of variance methods is no longer valid. 


Models with Reduced Number of Correlation Parameters 


Multivariate models can be used for inference with the general correlation structure 
shown in Display 15.1. However, the model may include an unneccessarily large 
number of variances and covariances for estimation. Simpler structures have been 
proposed that have fewer covariance parameters to estimate. They also are not as 
restrictive as compound symmetry or the Huynh—Feldt condition and can be more 
representative of the correlation behavior. 

Two such models are the serial correlation and the random coefficients mod- 
els. The serial correlation model has errors correlated within subjects or units de- 
fined by the relationship 


Cee; 


oe baa gz 


where pyz,-z,) is the correlation between errors at times 7 and 7 and o¢,¢, is the co- 
variance between errors at times 7 and 7 on the same subject. 

Correlation arises among the repeated measures in the random coefficients 
model as a consequence of the assumption that the treatment effects or regression 
coefficients vary across the subjects or experimental units. The simplest example is 
when the intercept of the time response profile varies between the units because 
some units are intrinsically low responders while others are high responders. 

The model has subject- or unit-specific treatment effects as a result of this as- 
sumption. The random coefficient models are most useful when it is desired to 
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make inferences about individuals rather than population averages, since the model 
coefficients for each individual can be estimated for prediction purposes. 

Statistical estimation methods for the alternative correlation structures are 
based primarily on maximum likelihood methods and are beyond the scope of this 
book. Diggle, Liang, and Zeger (1994) provide detailed discussions of the models 
and estimation methods available for studies with repeated measurements. 


EXERCISES FOR CHAPTER 15 


1. A study was conducted on human subjects to measure the effects of three foods on serum glucose 
levels. Each of the three foods was randomly assigned to four subjects. The serum glucose was 
measured for each of the subjects at 15,30, and 45 minutes after the food was ingested. The data 
are shown in the table. | 


Time (minutes) 
Diet Subject 15 30 45 


] 1 28 34 32 
2 15 Fie 27 
3 12 33 28 
4 21 44, 39 
2 5 an 18 12 
6 23 pe 10 
7 16 9 
8 4) 24 15 
3 9 31 30 be 
10 28 27 36 
11 24 26 36 
12 a4 26 a2 


a. Describe the study in terms of the between-subjects and within-subjects designs. 

b. Compute the mean of the observations for each diet at each time of measurement, and make a 
profile plot of the results for each treatment. 

c. Write the linear model for a split-plot analysis of variance, identify the terms, and indicate the 
assumptions necessary for an analysis of the data. 

d. Conduct the split-plot analysis for the data, test the necessary hypotheses, and compute treat- 
ment means and their standard errors. What are your conclusions? 

e. Obtain the residual plots from the split-plot analysis, and interpret them. 

f. Compute the sum of squares partitions for the linear and quadratic contrasts on time and their 
interactions with diet, test the null hypotheses, and interpret the results. 

g. If a repeated measures analysis computer program is available, test the hypothesis that the 
Huynh-Feldt condition can be assumed for the © matrix of the experimental errors for within 
subject variances and covariances. What is your conclusion? 
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h. Suppose it is necessary to conduct a separate analysis for the linear and quadratic regression 
contrasts. Write the linear model for one of the contrasts, identify the terms, and indicate the 
necessary assumptions for the model. Compute the analyses for the contrasts, and interpret the 
results. Are the experimental error variances comparable for the contrasts? Do the results differ 
from those in part (f )? Explain. 


2. A soil scientist conducted an experiment to evaluate the effects of soil compaction and soil mois- 
ture on the activity of soil microbes. Reduced levels of microbe activity will occur in poorly aer- 
ated soils. The aeration levels can be restricted in highly saturated or compacted soils. Treated soil 
samples were placed in airtight containers and incubated under conditions conducive to microbial 
activity. The microbe activity in each soil sample was measured as the percent increase in CO pro- 
duced above atmospheric levels. 

The treatment design was a 3 x3 factorial with three levels of soil compaction (bulk 
density = mg soil/m®) and three levels of soil moisture (kg water/kg soil). There were two replicate 
soil container units prepared for each treatment. 

The CO, evolution/kg soil/day was recorded on three successive days. The data for each soil 
container unit are shown in the table. 


| Day 
Density Moisture Unit ] 2 3 
= 0.10 2.20 0.34 0.11 


2 2.90 bo7 125 
0.20 3 5.20 5.04 3.10 

“ eee Be Pe 
0.24 5 4.00 3.47 3.47 

6 

q 

8 


410 347 2.46 


1.4 0.10 2.60 1.12. 090 
ou OTS: 

0.20 9 a0. 3.326. he 

10 | rs 20 

0.24 1] Loe... 2 ge 

12 fn eee eB 2.69 

1.6 0.10 13 bi tins BOL a Vide 
14 AO O98 ero O22 

0.20 3 sew ies ZOO 202 

16 260s: Sit 1246 

0.24 17 L300 21698 %) 2346 

18 O50 § 0.34% 0,00 


Source: Dr. I. Pepper and J. Neilson, Department of Soil and Water 
Science, University of Arizona. 


a. Describe the study in terms of the between-subjects and within-subjects designs. 
b. Compute the mean of the observations for each soil bulk density and soil moisture level at 
each time of measurement, and make a profile plot of the results for each treatment. 
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c. Write the linear model for a split-plot analysis of variance, identify the terms, and indicate the 
assumptions necessary for the analysis with this model. 

d. Conduct the split-plot analysis for the data, test the necessary hypotheses, and compute treat- 
ment means and their standard errors. What are your conclusions? 

e. Obtain the residual plots from the split-plot analysis, and interpret them. 

f. Compute the sum of squares partitions for the linear, quadratic, and cubic contrasts on time 
and their interactions with density and moisture treatments; test the null hypotheses, and inter- 
pret the results. 

g. If a repeated measures analysis computer program is available, test the hypothesis that the 
Huynh-—Feldt condition can be assumed for the 4: matrix of the experimental errors for within- 
subject variances and covariances. What is your conclusion? 

h. Apply the Greenhouse—Geisser adjustment to the F' tests in part (d). Do the conclusions differ? 
Explain. 

i. Conduct a separate analysis for the linear and quadratic regression contrasts. Write the linear 
model for one of the contrasts, identify the terms, and indicate the necessary assumptions for 
the model. Compute the analyses for the contrasts, and interpret the results. Are the ex- 
perimental error variances comparable for the contrasts? Do the results differ from those in 
part (f)? Explain. 


The fabric of athletic clothing may change the skin’s hydration state because the fabric serves as a 
barrier to the dissipation of body-generated water. A textile scientist conducted a study to evaluate 
the effect of fiber type and fabric moisture content on evaporative water loss from the skin. 

Five male subjects were used for the study. Each subject was used to evaluate five fabric treat- 
ments. The test fabrics were cotton and polyester fabrics commonly used in athletic clothing. The 
test was conducted by placing a piece of fabric directly on the subject’s forearm skin surface. An 
instrument was used to measure the amount of water that evaporated from the subject’s skin sur- 
face. The test was conducted in a controlled environment room at 70° F and 65% relative humidity. 

The fabric treatments were (1) cotton at equilibrium, (2) cotton at saturation, (3) stiff polyester 
at equilibrium, (4) stiff polyester at saturation, and (5) soft polyester at saturation. 

Two of the treatments listed indicate the fabric was at equilibrium. The moisture content of the 
fabrics in this case had been allowed to come to equilibrium with the room moisture level prior to 
test. 

The evaporative water loss was measured by the instrument after 60 minutes of fabric appli- 
cation to the skin. The data for all subjects on each of the treatments are shown in the table. 


Treatment 
Subject ] 2 3 4 | 
l 4.04 6.50 4.01 10.71 10.66 
2 220. eee 1.94 8.39 7.42 
3 Ee eee ee 1.58 8.63 13.86 
4 S00 4.15 4.09 515 
5 1.94 9.59 12.14 6.30 12 79 


Source: Dr. K. Hatch, Family and Consumer Resources, University 
of Arizona. 
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a. Describe the design. 

b. Write the analysis of variance mixed model for this study, identify the terms, and state the as- 
sumptions necessary for an analysis of the data. 

c. The textile scientist was interested in four particular contrasts among the treatments. The con- 
trasts of interest were (i) 1 vs. 2, (ii) 2 vs. 4, (iii) 2 vs. 5, and (iv) 3 vs. 4. Write out the table of 
contrasts. Are the contrasts orthogonal? Explain. 

d. Conduct the mixed-model analysis of variance for the data. Include an analysis of the contrasts 
listed in part (c). Interpret the results. 

e. Obtain the residual plots for the analysis, and interpret them. 

f. Conduct an analysis for each of the contrasts listed in part (c). Write the linear model for one 
of the contrasts, identify the terms, and indicate the necessary assumptions for the model. 
Compute the analyses for the contrasts, and interpret the results. Are the experimental error 
variances comparable for the contrasts? Do the conclusions differ from those in part (d)? 
Explain. 


4. An agronomist conducted a yield trial with five alfalfa cultivars in a randomized complete block 
design with three replications. Each plot was harvested four times in each of two years. The plot 
yields (Ib/plot) from two harvests from each plot in each of two years are shown in the table. 


1986 1987 

Cultivar Block April May April May 
l ] 204 a 14.8 22.9 
2 pa Re Ware ae A 18.8 22.6 

3 2d a@a4 14.3. 22.1 

2 l 19.1 , 22.4 14.5 19.2 
2 20.32 22H 10345:;22,0 

3 DOS: ad 12.0: 2245 

3 l 193.0224 14.5 19.5 
2 19.8 -25.4 16.0523 

3 20.5 24.8 L6.7* “20:1 

4 ieee 2 0 14.9 19.5 
21.3" "244 16.0 18.1 

2 268 208 ht eee 

5 | 21.4 24.5 14.1 21.3 
Z 20.1 22,9 I26- 20.0 

3 i 218 14.3. 21.4 


Scene an SN Es SSN SEAN SN OLDE A SONS a SE 
Source: Dr. M. Ottman, Department of Plant Sciences, University of 
Arizona. 


a. Describe the study in terms of the between-subjects and within-subjects designs. 

b. Compute the mean of the observations for each cultivar at each harvest of the two years, and 
make a profile plot of the results for each cultivar. 

c. Write the linear model for a split-plot analysis variance, identify the terms, and indicate the as- 
sumptions necessary for an analysis of the data. 


f. 
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If a repeated measures analysis computer program is available, obtain the 4 matrix of experi- 
mental errors for within-subject variances and covariances from the four measurements over 
months and years. Test the hypothesis that the Huynh—Feldt condition can be assumed for the 
> matrix. What is your conclusion? | 

The agronomist wants to compare the yields of the cultivars. The significance of interaction 
between cultivars, years, and months of harvest must be determined before the agronomist can 
compare the cultivar averages over months and years. The Huynh—Feldt assumption for the & 
matrix of experimental errors should be tested for the analysis of months x cultivars and for 
the analysis of years x months x cultivars if your program has the capability. Conduct the 
tests, compute the split-plot analysis for the data, and test the hypotheses for cultivar interac- 
tions with months and years. Use the G—G or H-F epsilon adjustments for the F tests if neces- 
sary. What are your conclusions from this analysis? 

Obtain the residual plots from the split-plot analysis, and interpret them. 


An agronomist conducted an experiment to evaluate the drought tolerance of four barley cultivars. 
He used a line source sprinkler system to create a water gradient treatment design on each cultivar 
plot. A description of the design was presented in Section 15.6. The four cultivars were randomly 
assigned to the plots in a randomized complete block design. The amount of water applied to each 
of the plots decreased with distance from the sprinkler line. The grain yield was measured on 12 sq 
ft subplots on each cultivar plot at four equally spaced distances from the line sprinkler. The data 
(grams of barley grain per 12 sq ft) for the subplots are shown in the table. 


Distance from Sprinkler* 


Variety Block ] 2 3 4 
l ] 416. /: <3si@tes 280 W984 
2 490.2 513.7 438.4 348.1 
= 341.2 452.0 541.5 458.8 
2 l 644.7 555.4 587.8 413.7 
$. 526.8 481.4 490.3 468.1 
3 540.6 S04 3° 495.9° 5235-5 
3 l 368.9 “ast 8 3550 222.2 
2 298.5 40/5 500.0 . 320.3 
3 386.7 388.4 492.4 438.2 
4 ] 512.0 5989 442.1 1386.0 
2 484.8 542.5 463.1 383.2 


3 368.5 547.8 702.9 445.3 


"1 = closest to sprinkler, 4 = greatest distance from sprinkler 
Source: Dr. M. Ottman, Department of Plant Sciences, University 
of Arizona. 


Describe the study in terms of the between-subjects and within-subjects designs. 

Compute the mean of the observations for each cultivar at each distance, and make a profile 
plot of the results. 

Write the linear model for a split-plot analysis of variance of the data, identify the terms, and 
indicate the assumptions necessary for an analysis of the data. 
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d. 


Conduct the split-plot analysis for the data, test the necessary hypotheses, and compute treat- 
ment means and their standard errors. What are your conclusions? 

Compute the sum of squares partitions for the linear, quadratic, and cubic contrasts on distance 
from the sprinkler; test the null hypotheses and interpret the results. 

If a repeated measures analysis computer program is available test the hypothesis that the 
Huynh-—Feldt condition can be assumed for the 4: matrix of the experimental errors for within- 
subject variances and covariances. What is your conclusion? 

Apply the Greenhouse—Geisser adjustment to the F tests in part (d). Do the conclusions differ? 
Explain. 

Conduct a separate analysis of the linear, quadratic, and cubic regression contrasts. Write the 
linear model for one of the contrasts, identify the terms, and indicate the necessary assump- 
tions for the model. Compute the analyses for the contrasts, and interpret the results. Are the 
experimental error variances comparable for the contrasts? Do the results differ from those in 
part (e)? Explain. 


15A.1 Appendix: The Mauchly Test for Sphericity 


The Huynh—Feldt condition for the matrix of variances and covariances of the p re- 
peated measures of subjects requires (p — 1) normalized orthogonal contrasts for 
the repeated measures to be uncorrelated with equal variances. Let 2 be the 
covariance matrix of the repeated measures. Let the matrix C be a (p— 1) x p 
matrix, where the rows are normalized orthogonal contrasts on the p repeated 
measures. The required Huynh—Feldt condition for the covariance of the contrasts 
is C 4 C’ = XI, where I is the identity matrix and C’ is the transpose of C. If 
the condition is satisfied, the covariance matrix AJ is said to be spherical. 

Let s;; be the element in the zth row and jth column of the pooled p x p co- 
variance matrix for the within-subject experimental errors S, with v degrees of 
freedom. Choose (p—1) normalized orthogonal contrasts on the p repeated 
measures. Let the matrix C’ be the (p—1) x p matrix, where the rows are 
normalized orthogonal contrasts on the p repeated measures. Compute the 
(p — 1) x (p— 1) matrix CSC". The test statistic (Mauchly, 1940) for the null 
hypothesis Hp: C & C’ = AT is 


Veg 1)" | CROr 


(tr OSC’! eal 


where trC’SC’ is the trace of the matrix. The trace of a matrix is the sum of its 
diagonal elements. The test statistic is scaled to improve the accuracy of its 
approximation by the chi-square distribution. The scale factor for the chi-square 
approximation with f = ; p(p — 1) — | degrees of freedom is 


a 2p? —3p+3 
6(p — 1) 


The null hypothesis is rejected at the a level of significance if — yInW > Lon fr 


ye (15A.2) 
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15A.2 Appendix: Degrees of Freedom Adjustments for 
Repeated Measures Analysis of Variance 


The Greenhouse—Geisser € (Greenhouse & Geisser, 1959), and the Huynh—Feldt € 
(Huynh & Feldt, 1976) were proposed as degrees of freedom adjustments for F’ 
tests in the analysis of within-subjects treatment factors. The adjustments, based on 
work by Box (1954a, 1954b), were developed for designs with only one within- 
subject treatment factor. The Huynh—Feldt € is a simple function of the 
Greenhouse—Geisser €. The computations are outlined in the following paragraphs 
beginning with computations for €. 

Let s;; be the element in the zth row and jth column of the pooled p x p co- 
variance matrix for the within-subject experimental errors, S. Choose gq = (p — 1) 
normalized orthogonal contrasts on the p repeated measures. Let the matrix C' be a 
q X p matrix, where the rows are normalized orthogonal contrasts on the p repeated 
measures. Compute the q xq matrix A=CSC” with elements {a;;}. The 
Greenhouse—Geisser € adjustment is computed as 


BS ees (15A.3) 
q y ai, 
7=1 j=1 
The Huynh—Feldt € adjustment is computed as 
Nqé —2 
_2 (15A.4) 
qv — ge) 


where JV is the number of subjects and v is the error degrees of freedom for the ex- 
perimental error from the between-subjects analysis of variance. If the experiment 
consists of NV subjects each with p treatments, then vy = N — 1. With t between- 
subjects treatments in a completely randomized design, then vy = N —t. With r 
replications of t between-subjects treatments in a randomized complete blocks de- 
sign, then v = (t — 1)(r — 1). 

Given a within-subjects Fo statistic based on mean squares with 1, and 12 
degrees of freedom the adjusted degrees of freedom for the test are ev, and él, 
where € is used for the Greenhouse—Geisser adjustment and € is used for the 
Huynh-—Feldt adjustment. The Huynh—Feldt & is not used if ‘€ > 1. 


16 Crossover Designs 


The principal topic of this chapter is the design and analysis of experiments with dif- 
ferent treatments administered in successive periods of time to the experimental 
units. The analysis of the experiments includes an evaluation of the effect of a treat- 
ment that may carry over to affect the response of a treatment in the following 
period. Designs specifically constructed to efficiently estimate the direct and carry- 
over effects of treatments are presented in this chapter. 


16.1 Administer All Treatments to Each Experimental Unit 


520 


The crossover study describes experiments with treatments administered in se- 
quence to each experimental unit. A treatment is administered to an experimental 
unit for a specific period of time after which another treatment is administered to 
the same unit. The treatments are successively administered to the unit until it has 
received all treatments. 


Crossover Designs Can Increase Precision and Reduce Costs 


When treatments are compared on the same experimental unit, the between-unit 
variation is removed from the experimental error. Thus, the individual experimental 
units are used as blocks in the crossover design to decrease the experimental error 
and increase efficiency of the experiment. The treatment comparisons in blocked 
designs are generally more precise than those in unblocked designs because the ex- 
perimental unit or block variation is removed from comparisons between treatments 
on the same experimental unit. Thus, the primary advantage of crossover studies is 
the increased precision of treatment comparisons. 
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Crossover designs provide an economy of resources when a limited number of 
units are available for the study. Most commonly, crossover designs are used with 
human and animal subjects. The expense of maintaining large animals and the diffi- 
culties in recruiting adequate numbers of human subjects to achieve sufficient repli- 
cation make crossover designs more attractive since they require fewer units for an 
equal amount of treatment replication. The following example of an experiment 
used a crossover design with beef steers in a feeding trial. 


Example 16.1 Digestibility of Feedstuffs in Beef Cattle 


Associative effects occur in animal diets when feedstuffs are combined and 
diet utilization or animal performance is different from that predicted from a 
sum of the individual ingredients. The addition of roughage to the diets of 
ruminant animals had been shown to influence various diet utilization factors 
such as ruminal retention time. However, information about the relative asso- 
ciative effects of different roughage was scarce, especially in mixed feedlot 
diets. 


Research Hypothesis: An animal scientist hypothesized roughage source 
could influence utilization of mixed diets of beef steers by altering ruminal 
digestion of other diet ingredients. 


Treatment Design: The basic mixed diet for the beef steers was a 65% con- 
centrate based on steam-flaked milo and 35% roughage. Three roughage 
treatments were used with (A) 35% alfalfa hay as a control treatment, (B) 
17.5% wheat straw and 17.5% alfalfa, and (C) 17.5% cottonseed hulls and 
17.5% alfalfa. 


Experiment Design: Twelve beef steers were available for the study. Each of 
the three roughage diets was fed to the steers in one of six possible sequences. 
Each diet in each sequence was fed to two steers for 30 days. The steers were 
allowed a period of 21 days to adapt to a diet change before any data were 
collected. 

The Neutral Detergent Fiber (NDF) digestion coefficient calculated for 
each steer on each diet is shown in Table 16.1. The NDF digestion coefficient 
indicates the percent of dietary fiber digested by the steer. 


Table 16.1 NDF digestion coefficients for two steers in each sequence of three 
roughage diets in a crossover design 


Sequence 


1 2 =] “ 5 6 


Steer: Pee BSEhy 5 6 pe eR 6 10 a gee 
Peel (4) 30 8o th) ae ot oe ee Ue) 50 «55 (C41 46 
Pescd ll (B) 6 60 tC) 4 AS A 55 56 TC) 4B 51 (A) 57 59 - B) 56. 58 
Peoeat 0) CA) 37° SD Ba 50 OB) ST, 34 fy 51 55° (A) 58- 61 


Source: J. Moore, Department of Animal Science, University of Arizona. 
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Other Examples: A new drug and a standard drug are tested in a crossover design 
using patients afflicted with acute bronchial asthma to determine whether the new 
drug improves the patients’ breathing over the standard drug. The new drug is 
administered to a patient during the first week, and the standard drug is 
administered to the same patient during the second week. 

Three front panel designs developed to operate a laboratory instrument are 
tested in a crossover design with laboratory technicians as operators. The technician 
operates the instrument with each of the panel designs. Each technician tests the 
three panel designs on successive days, a different panel each day. 


Design to Avoid Confounding Time Period Effects with Treatments 


A comparison between two treatments on the same experimental unit is also a com- 
parison between two time periods. Treatments and periods both can contribute to 
any observed differences. Crossover trials are designed to avoid confounding of 
period and treatment effects. For example, one group of experimental units will re- 
ceive the sequence A-B, and a second group of units will receive the sequence 
B-A. Both treatments are administered in each period, and the treatment com- 
parisons are independent of comparisons between periods. 


A Carryover Effect Can Persist After the Treatment Period 


A disadvantage of the crossover design is the possibility that a treatment given in 
one period will influence the response in the following treatment period. Effects of 
a treatment that continue into the next treatment period are carryover effects. 

Typically, the treatments are administered in crossover designs for a length of 
time sufficient to allow the effect of the treatment to be manifested on the subjects. 
The subject is removed from the treatment for a resting, or washout, period be- 
tween two treatment periods to bring the subject back to its original physiological 
or psychological state. 

For example, after a drug is administered to a patient the rest period is intended 
to allow any residual of the drug to “wash out” of the patient’s system so that it will 
not be present in the succeeding treatment period. Although sufficient resting time 
is allowed for the initial drug to disappear from the system, the physiological state 
may have been altered sufficiently to have some effect on the responses in the suc- 
ceeding treatment period. The potential for a carryover effect cannot be ignored in 
crossover studies. 

Since carryover effects are assumed to be present in studies, designs con- 
structed specifically to measure carryover effects will be discussed in this chapter. 
A variety of designs have been developed for crossover studies to meet specific 
needs of different research problems. The discussions in this chapter are restricted 
to a few basic crossover designs to illustrate basic principles for design and analysis 
of crossover studies to meet specific goals of the research study. 

The basic crossover design in Example 16.1 will be used to illustrate the rela- 
tionship between carryover effects of treatments and other effects present in the 
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experiment as well as to provide the foundation for the basic statistical model and 
analysis of crossover studies. 


A Balanced Row—Column Design for the Digestibility Study 


The crossover design for the roughage diets is a balanced row—column design. The 
periods and steers are the rows and columns of the design. Each of the roughage di- 
ets occurs one time in each steer and four times in each period of the design. The 
design was referred to as a Latin rectangle design in Chapter 8. 

The six sequences of diet treatments are shown in Display 16.1. The six se- 
quences of roughage diets were randomly assigned to the 12 steers. However, the 
order of diet administration to each of the steers was not randomized. Each of the 
six sequences of the three treatments must be present with equal frequency to avoid 
confounding the period effects with the treatment effects and to have a design bal- 
anced for carryover effects. 


Display 16.1 A Crossover Design with Six Treatment Sequences 
for Three Treatments in Three Periods 


Sequence 


Period 
I 


I 


Designs to Balance the Carryover Effects 


A crossover design is balanced for carryover effects when each treatment follows 
each of the other treatments an equal number of times. Each treatment occurs 
equally frequently in each period, and it occurs once with each subject. 

The crossover design for the roughage diet study shown in Display 16.1 is a 
balanced design. Diet A follows diet B twice, once each in sequences 5 and 6; and 
it follows diet C twice, once each in sequences 2 and 3. Likewise, diets B and C fol- 
low each of the other two treatments two times. The balance applies only to first- 
order carryover effects that alter the response in the first period following admin- 
istration of the treatment. A second-order carryover effect alters the response in the 
second period following administration of the treatment. 

The 21-day adaptation period in the roughage diet study is equivalent to the 
washout, or rest period, used in other studies to avoid a carryover effect. In the ab- 
sence of carryover effects the effects of any previous diet are not manifested in the 
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digestion physiology of the animal in the current period, and the digestion measure- 
ments reflect only the direct effects of the current diet. 

The control diet treatment with 35% chopped alfalfa hay was the standard 
growing diet for feedlot steers. The treatments were designed to address the hy- 
pothesis that alternate roughage sources influenced utilization of mixed diets by 
altering rumen digestion. Diets B and C provided alternate roughage sources of 
wheat straw or cottonseed hulls mixed with the alfalfa. The remaining parts of the 
diet including grain and minerals were the same for all diets. If the research hy- 
pothesis is true the digestion of dietary fiber will differ among the three treatments. 

The three treatment means calculated from Table 16.1 data are, y, = 56.6, 
Yp = 53.3, and Yc = 47.2. The observed mean measures the direct effect of a treat- 
ment in the periods it was active on the subjects plus the carryover effects of the 
other two treatments. The direct effects of diet A are not confounded with periods 
or steers because they were measured in each of the periods and on each of the 
steers. In the absence of carryover effects 7, = 56.6 is an unbiased estimate of wa 
in the balanced design. Since diet A followed diet B in two sequences and diet C in 
two other sequences the response to diet A could be increased or decreased by any 
carryover effects of diets B or C administered to the steers in the previous periods. 


16.2 Analysis of Crossover Designs 


The Linear Model for Crossover Designs 


The crossover design has n treatment sequence groups and r; subjects in the zth 
group. There are ¢ treatments and each group of subjects receives treatments in a 
different order for p treatment periods. Let y;,, be the observation of the jth subject 
of the ith treatment sequence in the kth period. 

The linear model for a crossover design is 


Uisk = ph Ole Og ee Tia) FH Ace ey Ph Cask (16.1) 
(2 1,2.0:.,0 te So ee ee 


where yp is the general mean, a; is the effect of the 7th treatment sequence, b;; is the 
random effect with variance o? for the jth subject of the ith treatment sequence, 7; 
is the period effect, and e;,, is the random error with variance o? for the subject in 
period k. The direct effect of the treatment administered in period k of sequence 
group 2 IS Taiz), and A,(;, 4-1) is the carryover effect of the treatment administered 
in period k — 1 of sequence group 7. The value of the carryover effect for the ob- 
served response in the first period is A,(;9) = 0 since there is no carryover effect in 


the first period. 
For simplicity, the direct and carryover effects of the treatments are identified 
aS T1,72,...,7¢ and Aj, Ao,..., Az, respectively. The expected values for observa- 


tions in the first and second treatment sequences of the roughage diet study are with 
Yiz Tepresenting sequence 7 in period k: 
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Sequence | (A> B- C) BYyu)=ewratyt 
E(yi2) = w+ ay ae aes es 
E(yi3) = w+ ay + 73 + 73 + Ag 


Sequence 2 (B> C- A) E(y) = ptag+¢71+79 
Pig) = eg + 9 +75 + Ag. 
BGs) chia 3 +71 + As 


Carryover effects occur only in observations from the second and third period. 
E.g., Ac(1,0) = 0 in sequence | because no treatment precedes diet A. The observa- 
tions in periods 2 and 3 contain the carryover effects of diet A and diet B, A; and 
A2, respectively. Likewise, for sequence 2 the carryover effects of diets B and C, A» 
and 43, occur in the observations from the second and third periods, respectively. 


Assume the Univariate Model Assumption Is Satisfactory 


The observations on each experimental unit are repeated measures in time under 
different treatment conditions. They represent a multivariate observation on the ex- 
perimental unit. The assumptions required for the univariate analysis of variance 
were discussed in Chapter 15, “Repeated Measures Designs.” Kenward & Jones 
(1989) and Diggle et al. (1994) discuss details of repeated measures models for 
crossover designs beyond the scope of this book. 

Whether the univariate analysis of variance can be used for the crossover de- 
signs depends on the relationships among variances and covariances of the experi- 
mental errors for the repeated measures. The univariate analysis can be used if any 
of the assumptions for independence, compound symmetry, or the Huynh—Feldt 
condition is appropriate for the experimental errors. 

The Huynh—Feldt condition of equal variances for all possible differences be- 
tween repeated measures is the least restrictive assumption for the experimental er- 
rors. It will be assumed the Huynh—Feldt condition is satisfied for the data. 


The Analysis of Variance for Crossover Designs 
The Analysis Without Carryover Effects 


If the crossover design is a balanced row—column design the analysis of vari- 
ance described in Chapter 8 can be used in the absence of carryover effects, The 
subjects and time periods are the rows and columns of the design, and the direct 
treatment effects are orthogonal to the rows and columns. The sums of squares for 
rows, columns, treatments, and experimental error can be computed with the analy- 
sis of variance shown in Table 8.10. 


The Analysis with Carryover Effects 


The analysis of variance for the model with treatment carryover effects in 
Equation (16.1) is outlined in Table 16.2. The separation of the sums of squares 
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Table 16.2 Analysis of variance table for a crossover design with n sequences, p 
periods, ¢ treatments, and r; subjects in the 7th sequence; NV = i? 1; 


Source Degrees of Sum of Mean 
Variation Freedom Squares Square 
Total Np-1 S'S' Total 


Between subjects: 


Sequence n—1 SSS MSS 

Subjects within sequence (N —n) SSW MSW 
Within subjects: 

Period p— | Son MSP 

Treatments (direct) t—1 SST MST 

Treatments (carryover) t—1 SSC MSC 
Error (N — 1)\(p— 1) -— 20-1) SSE MSE 


partitions into between- and within-subjects groupings indicates the correspondence 
to a repeated measures split-plot univariate analysis. The subjects are the whole 
plots and the repeated measures over the p periods are the subplots. 


Treatment and Carryover Effects Are Nonorthogonal 


When carryover effects are present the direct treatment effects and carryover 
effects are not orthogonal, nor are the carryover effects orthogonal to the subject 
blocks in the balanced row—column design. The relationship between the effects 
can be observed in Display 16.2. The periods and sequences are each complete 
block designs for the direct treatment effects. The sequences form a balanced in- 
complete block design for the carryover effects. Each pair of carryover effects oc- 
cur together with two of the sequences. The direct and carryover treatment effects 
are not orthogonal because they do not occur in all possible combinations. The  di- 
rect and carryover effects of the same treatment never occur together in the same 
observation. 


Display 16.2 Model Effects, 7; and ,, for the Roughage Diet Study 


Sequence 
Period 


Ty + A3 T3 + ry 
T2 + Ay T2 + A3 
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The significance of carryover effects must be determined before inferences can 
be made about differences among the direct effects of treatments. Estimates of 
differences between treatment means, p,—p;, need to be adjusted for the 
carryover effects if they are present in the study. 


Adjusted Sums of Squares for Nonorthogonal Effects 


The sums of squares for direct and carryover treatment effects in the analysis 
of variance each must be adjusted for the other. 

The adjusted sums of squares are computed from the differences between sums 
of squares for experimental error from full and reduced models. The sum of squares 
for carryover effects adjusted for direct effects requires the error sums of squares 
from the models: 


Reduced model -yijg = tj + Dig + Yk + Ta t+ ize (16.2) 


Full model Yijk = Wat bij + ye + Ta + Ac + eije 


The sum of squares for carryover effects adjusted for direct effects is SSC = 
SSE) — SSE;z, where SSE) is the error sum of squares computed from the 
reduced model without A, and SSE; is computed from the full model. 

The sum of squares for direct effects adjusted for carryover effects requires the 
error sums of squares from the models: 


Reduced model yin = po + 04 + dij + Yk + Ac + Cigk (16.3) 


Full model Yaz = + 04 + dij; + Ye + Ac + Ta + eign 


The sum of squares for direct effects adjusted for carryover effects is SST = 
SSE 77) — SSEs, where SSE,,;) is the error sum of squares computed from the 
reduced model without 74 and S'S E; is computed from the full model. 


Data Coding for Computing Programs Requires Special Attention 


Many statistical computer programs compute the required sums of squares par- 
titions for balanced or unbalanced designs. Special attention must be given to cod- 
ing the data file to include the carryover effects in the model. Alternative coding 
strategies for the carryover effects can be found in Ratkowsky, Alldredge, and 
Cotton (1990) and Milliken and Johnson (1984). The detailed coding for the data 
file of Example 16.1 is shown in Appendix 16A.1. Formulae for manual calcula- 
tions are available for selected balanced designs in Cochran and Cox (1957), Peter- 
sen (1985), and Gill (1978). A brief outline of critical formulae for adjusted 
treatment sums of squares for balanced designs appears in Appendix 16A.2. 
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The Analysis of Variance for Roughage Diets 


The analysis of variance for the roughage diet study is shown in Table 16.3. 
The two analyses illustrate the alternative fitting of full and reduced models to ob- 
tain the adjusted sums of squares for carryover and direct treatment effects. 


Table 16.3 Analysis of variance for NDF digestion from the roughage diet study 


in a crossover design 


Source of Degrees of Sum of Mean 
Variation Freedom Squares Square F Pro 
Sequence 5 331.67 66.33 3.48 .080 
Steers in sequence 6 114.33 19.06 
Period 2 288.17 144.08 16.01 .000 
Diet p 359.50 219 te S109 .000 
Carryover ) 18.37 9.197 1.02 380 
Error 18 161.96 9.00 

“Mean Square for direct treatment effects unadjusted for carryover effects, MST(unadj.) 

‘Mean Square for carryover treatment effects adjusted for direct effects, MSC(adj.) 
Source of Degrees of Sum of Mean 

Variation Freedom — Squares Square Ff af ae ae al 

Sequence 5 S51,01 66.33 3.48 .080 
Steers in sequence 6 114.33 19.06 
Period B. 288.17 144.08 16.01 .000 
Carryover z 129.60 64.80* 7.20 005 
Diet 2 448.28 224.14? 24.91 .000 
Error 18 161.96 9.00 


“Mean Square for carryover treatment effects unadjusted for direct effects, MSC(unadj.) 
*Mean Square for direct treatment effects adjusted for carryover effects, MST(adj.) 


The first analysis in Table 16.3 provides the mean square for carryover effects 
adjusted for direct effects, M/S(Carryover, adj.) = 9.19, andthe second analysis 
provides the mean square for direct treatment effects adjusted for carryover effects, 
M S(Diet, adj.) = 224.14. The differences between the adjusted and unadjusted 
mean squares reflect the nonorthogonal relationship between the carryover effects 
and direct effects of treatments. The unadjusted mean square for direct treatment 
effects in the first analysis is M/S(Diet, unadj.) = 279.75, and the unadjusted mean 
square for carryover effects in the second analysis is MS(Carryover, unadj.) 

= 64.80. 


16.2 ANALYSIS OF CROSSOVER DESIGNS 529 


Tests of Hypotheses for Direct and Carryover Effects of Treatments 


The null hypothesis for carryover effects is Ho: A; = Ax = A3 = O. It is the hy- 
pothesis of initial interest for the study because inferences about the direct treat- 
ment effects are dependent on the presence of carryover effects. The test statistic 
for the hypothesis of no carryover effects is 


M S(Carryover, adj.) #919 


MSE "sop 


= 
with 2 and 18 degrees of freedom. The level of significance for the test is 
Pr > F' = .380, and the null hypothesis is not rejected. The carryover effects are 
not significant in the roughage diet study. 

The statistic Fo = MS(Diet, adj.)/MSE = 224.14/9.00 = 24.90 with 2 and 
18 degrees of freedom tests the null hypothesis for direct treatment effects. The null 
hypothesis is rejected with significance level Pr > F = .000. There are significant 
differences in NDF digestion among the three roughage diets. 

The significance of carryover effects unadjusted for direct treatment effects in 
the second analysis of variance in Table 16.3 again reflects the nonorthogonal rela- 
tionship between direct and carryover effects. The correlation between their esti- 
mates leads to difficulties in distinguishing between their separate contributions. 
The sum of squares for carryover effects unadjusted for direct effects, 


SS(Carryover, unadj.) = 129.60 


_-—~is.considerably larger than the adjusted sum of squares, 
a < S'S(Carryover, adj.) = 18.37 


because the direct effects are intermingled with the carryover effects in the unad- 

justed sum of squares. Consequently, the significance of the unadjusted mean 

square is due to a combination of variation in direct and carryover effects, and it is 
~——not possible to determine which set of effects is contributing to the significance. 

__ Ifthe carryover effects are not significant it is common practice to base all in- 
ferences on the treatment means unadjusted for carryover effects. The analysis for 
Latin square designs in Chapter 8 would be used in this case. However, a detailed 
study by Abeyasekera and Curnow (1984) on the consequences of ignoring any ad- 
justments for carryover effects led them to recommend always adjusting the treat- 
ment means for carryover effects regardless of the significance of the test for 
carryover effects. 


Interpretation of Treatments with Multiple Contrasts 


According to the research hypothesis the rumen digestion efficiency is dependent 
on the source of roughage. The standard or control diet A included 35% chopped 
alfalfa hay. The altered diets had half that amount of alfalfa and 17.5% wheat straw 
(diet B) or 17.5% cottonseed hulls (diet C). The null hypothesis of no change in 
NDF digestion by the altered diets can be tested with comparisons of the control 
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diet with the altered diets using the Dunnett method. The estimates of the contrasts 
between the adjusted means of altered diets and diet A, i, — fig and fi, — fic, are 
shown in Table 16.4, along with the 95% simultaneous confidence intervals for 
comparisons of control with other treatments using the Dunnett method. 


Table 16.4 Estimates of contrasts between the control and altered diets for the 
roughage diet study in a crossover design 


Standard 95% SCI 
Contrast Estimate Error (h,0) 
Diet A—Diet B 4.06 1.37 (0.77, 7.35) 
Diet A—Diet C 9.63 1.27 | (6.34, 12.92) 


The estimated standard error of the difference between two adjusted means is 
1.37 in the third column of Table 16.4. The two-sided Dunnett statistic for two 
comparisons is do5.2,13 = 2.40. The 95% SCI indicate that the replacement of half 
the alfalfa hay by cottonseed hulls (diet C) in the diet reduced the NDF digestion to 
a greater extent than replacement by wheat straw (diet B). since the lower limit of 
the interval for diet C is further removed from 0 than is that for diet B. 

The critical value for a confident inequalities test with the Dunnett method is 
| i, —B; | > D(2,.05) = 1.37(2.40) = 3.28. The difference for diet A versus diet 
B, 4.06, and the difference for diet A \versus. diet C, 9.63, are both significant 
because they exceed the critical value. The replacement of half the alfalfa hay by 
cottonseed hulls (diet C) in the diet reduces \ gestion by an estimated 
9.63%, and the replacement by wheat straw (diet B) reduces the NDF digestion by 

an estimated 4.06%. 


16.3. Balanced Designs for Crossover Studies 


Practical considerations dictate whether a crossover study is appropriate for the re- 
search problem. The designs are most effective when treatments manifest an effect 
on the subject in a reasonably short period of time to provide a study time of man- 
ageable length. Extensive discussions of crossover studies can be found in Jones 
and Kenward (1989) and Ratkowsky, Evans, and Alldredge (1993). 


When Crossover Designs Can Be Successful 


Studies that measure responses to changes in animal diets have been successfully 
completed with crossover designs. Crossover designs reduce the costs of main- 
taining the larger numbers of animals required to provide equivalent treatment 
replication with other designs. The cost savings are especially pertinent in large ani- 
mal studies. Human factors studies can use crossover designs to remove subject- to- 
subject variability in stress or anxiety responses to treatments. Differential 
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physiological or psychological stresses are measured readily on each subject in ex- 
periments designed to compare different systems of component assembly or instru- 
ment operation with each subject. 


When Crossover Designs May Not Be Successful 


The designs should not be used in medical clinical trials for acute conditions such 
as postoperative pain. If a treatment in the sequence cures the acute condition noth- 
ing remains to treat in the succeeding periods. The designs can be used effectively 
to study the treatment of persistent conditions such as arthritis when treatments 
alleviate the symptoms rather than cure when administered to the individual. 


Practical Considerations Influence Design Choice 


Other practical considerations influence the choice of a design if the crossover 
study is feasible. The likelihood of subject loss during the course of the study in- 
creases with the number of periods. The more desirable designs, those balanced for 
carryover effects, increase in complexity as more treatments are added to the study. 
Consequently, the correct order of treatment administration to subjects becomes 
more difficult to manage in large studies. The ability to interpret results can dimin- 
ish withthe complexity of the design. 
’~ Many different designs have been developed for crossover studies to meet spe- 
‘cific requirements for the practical research problem or to have good statistical 
"properties. Three general categories of designs introduced in this section are useful 
\ for a variety of research studies that can appropriately use crossover studies. They 
‘are restricted to balanced designs for three or more treatments and three or more 
periods. Two-period designs are not efficient for estimation of treatment means. 
Designs for two treatments require special orchestration of treatment sequences to 
estimate all of the effects of interest in the study. Designs for two treatments are 
discussed in a separate section. 
The categories of designs discussed in the remainder of this section are based 
on the relationship between the number of treatments ¢ and the number of periods 
p. They are designs with p = t,p = t+ l, and p < t. 


Compare Efficiency of Crossover Designs with Latin Squares 
The efficiency of a design for statistical estimation of direct and carryover effects is 
an important consideration in the choice of design. The variance of the difference 
between two treatment effects, 07, forms the basis for efficiency measures. The 
Latin square design, in the absence of carryover effects, provides the minimum 
variance, 20/r, and is the standard for comparison. The ratio of the variance for 
the Latin square to the crossover design measures the relative efficiency of the 
crossover design as 

207 /r 
Re = nes 100 (16.4) 

d 
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The relative efficiency measure is applied to either direct or crossover effects. The 
variance of the difference, a, pertains to either treatment effect adjusted for the 
other. 

The most desirable designs have variance balance where the variance of the 
difference between two treatment effects, direct or carryover, is the same for all 
treatment pairs. The balance defined by the relationship between direct and carry- 
over effects is the other desirable feature of a design. The design is balanced if the 
direct effect of each treatment is associated equally frequently with the first-order 
carryover effect of each other treatment. Patterson and Lucas (1962) presented 
some general formulae to compute efficiencies for balanced designs. 


Designs for Equal Numbers of Periods and Treatments, p = f 


The Williams Designs Constructed from Latin Squares 


Williams (1949) presented methods to construct balanced crossover designs 
from Latin square arrangements. If the number of treatments is even, t = 4,6,..., 
a balanced design can be constructed from one particular Latin square. Two 
particular squares are required if the number sss treatments is an odd number, 
bod, 5753 . 

Two 3 x 3 Latin squares that produced six treatment sequences were used for 
the design of the roughage diet study of Example 16.1. The design i is reproduced in 
Display 16.3 showing the two 3 x 3 Latin squares. One Latin square was used for 
the first three sequences of treatments in the = aymuls the second square 
was used for the last three sequences. ~» ? 


Display 16.3 A Balanced Crossover Design 
from Two 3 x 3 Latin Squares 
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Williams designs for t = 4, 5, and 6 treatments and balanced for carryover ef- 
fects are shown in Display 16.4. Each treatment follows each of the other treat- 
ments one time in the designs with one Latin square for aneven number of 
treatments, and the direct effects of each treatment associate once with the first- 
order carryover effects of all other treatments. For the designs with an odd number 
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of treatments each treatment follows the others twice, and the direct effects of each 
treatment associate twice with the first-order carryover effect of all other treat- 
ments. The relative efficiency of the Williams designs for direct effects Ez and the 
relative efficiency FE, for carryover effects are shown below each of the designs in 
Displays 16.3 and 16.4. 


Display 16.4 Balanced Latin Square Crossover Designs 
for Four, Five, or Six Treatments 


Four Treatments Six Treatments 


Sequence Group Sequence Group 
Period 3 Period 
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D 
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Five Treatments 

Sequence Group | Sequence Group 2 
Period 1 10 
is 101 A 
II B 
Il D 
IV B 
V c 
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Designs from Orthogonal Sets of Latin Squares 


A complete set of orthogonal Latin squares provides a crossover design bal- 
anced for all orders of carryover effects. An orthogonal set of Latin squares for t 
treatments requires t — 1 squares. Two Latin squares are orthogonal when the two 
squares are superimposed and each treatment of one square occurs once with each 
treatment of the other square. 

Two orthogonal 3 x 3 Latin squares were used for the roughage diet study. 
They make up the Williams design shown in Display 16.3. 

Note when square | is imposed over square 2 that the treatment label A in 
square | will occur one time each with treatment labels A, B, and C in square 2. 
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The same occurrence pattern is true of the labels B and C from square 1 when 
superimposed on square 2. 

The complete set of orthogonal squares requires t(¢ — 1) subjects for any num- 
ber of treatments. For example, the orthogonal set of three Latin squares for four 
treatments requires 12 subject groups. 

The efficiency of the design for estimating direct and carryover effects from 
the complete set of orthogonal squares is the same as that for the Williams designs. 
The Williams designs require only t — 1 subjects for t even and 2(¢ — 1) subjects 
for t odd, thus the only real advantage of a complete orthogonal set of Latin square 
designs over the Williams designs is that the complete set is balanced for all orders 
of carryover effects up to order (p — 1). For example, the design with four treat- 
ments is balanced for first-, second-, and third-order carryover effects. 

When only first-order carryover effects are present in the study the Williams 
designs have a clear savings in number of treatment sequences to maintain with 
more than three treatments. 


Extra-Period Designs for Orthogonal Direct and Carryover Effects, p = t+ 1 


The direct and carryover effects are not orthogonal in designs where the number of 
periods equals the number of treatments. Lucas (195) )) pointed out that the non- 
orthogonality of the balanced designs resulted in mt ich lower efficiency for esti- 
mating carryover effects than for estimating direct, effects of treatments. The 
differences in efficiencies for estimating the two e he can be seen in Display 
16.4. The addition of an extra period to the balanced design can remove the non- 
orthogonality and provide designs with independent estimates of direct and carry- 
over effects that have more equal precision for estimates of direct and carryover 
effects. Lucas (1957) provided the first formal description of extra-period crossover 
designs. 

The simplest extra-period design is derived from a balanced crossover design 
by repeating in period (p + 1) the treatment administered to the subject in period p. 
An extra-period design for t = 4 treatments in p=5 periods derived from the 
Williams design for four treatments is shown in Display 16.5. 

Each treatment administered to a subject in period IV is repeated on the same 
subject in period V. Each treatment is preceded by each of the other treatments in 
the design including itself. For example, treatment A is preceded by itself in se- 
quence 3, by B in sequence 2, by C in sequence 4, and by D in sequence 3. 

The extra-period Williams design is completely balanced for carryover effects 
since each carryover effect occurs an equal number of times in each sequence and 
with each direct effect. Thus, the estimates of the direct effects are the same wheth- 
er or not the carryover effects are in the model, and vice versa. The addition of an 
extra period makes the design unbalanced with respect to sequences and direct ef- 
fects because treatments appear an unequal number of times within each sequence. 
Note in Display 16.5 that one of the treatments appears two times in a sequence 
while each of the other treatments appears only once in the same sequence. 

The efficiencies for orthogonal sets of Latin squares with one extra period are 
the same as those for the Williams designs with one extra period. The advantage of 
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Display 16.5 Extra-Period Crossover Design 
for Four Treatments 
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increasing the precision for estimating carryover effects must be weighed against 
the disadvantage of extending the length and cost of the experiment with the extra 
period. The designs should only be used if there is a strong indication that the 
carryover effects are likely to occur in the study. Some bias in the responses can oc- 
cur if either the subjects or the investigators are aware that the last two measure- 
ment periods involve the same treatment. 


Designs for Less than a Full Treatment Cycle, p < ¢ 


Each subject received all of the treatments in the balanced designs discussed in the 
previous sections. Practical or ethical considerations can prevent adminstration of 
the full cycle of treatments to each of the subjects. It may be more practical to ad- 
minister only part of the treatments to each subject to reduce the chances of sub- 
jects dropping out of the study. Ethical considerations may prevent the investigator 
from administering too many treatments to patients in a clinical trial. 

Patterson (1951, 1952) described methods to construct balanced designs for 
studies that required the number of periods to be less than the number of treat- 
ments, p < t. The subjects are incomplete blocks in the design because they receive 
less than the full complement of treatments. The incomplete designs may be de- 
rived by deleting one or more periods from a complete set of orthogonal Latin 
squares. The incomplete designs also may be obtained from the incomplete Latin 
squares or Youden squares discussed in Chapter 9 for row—column designs. Finally, 
they can be constructed from regular balanced incomplete block designs. For 
illustration, a design for seven treatments and four periods is shown in Display 
16.6. 
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Display 16.6 Incomplete Design for Seven 
Treatments and Four Periods 
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Tables of incomplete Latin square designs for crossover studies can be found 
in Petersen (1985) and Patterson and Lucas (1962). Jones and Kenward (1989) 
listed the most efficient incomplete designs for t = 3,4, 5,6, and 7 treatments. 


16.4 Crossover Designs for Two Treatments 


Perhaps the most widely used design for crossover studies has been the two- 
treatment, two-sequence, two-period design. One group receives the treatment se- 
quence A-B and another group receives the treatment sequence BA, as shown in 
Display 16.7. Each of the treatments appears in/each sequence and in each period of 
the 2 x 2 Latin square arrangement. Thus, the treatment effects are not confounded 
with the effects of sequences or periods, the row and column equivalents of the 


Latin square design. 


Display 16.7 The2 x 2 Crossover Design 
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The Linear Model for Two Treatments 
The linear model for the design with two treatments is \ 


Yijk = + a4 + di; 49H Td HNe Heeigs, | (16.5) 
| | 
Re SS oe Os ee eae, 
sn es 
where p is the general mean, a; is the cee the random subject ef- 
fect with mean 0 and variance oe 7, is the period effect, 7, is the direct treatment 
effect, A. is the carryover effect, and e;,;, is the independent, random error with 


mean 0 and variance o”. 


Carryover Effects Confounded in the 2 x 2 Design 


The 2 x 2 crossover design has one major deficiency. The carryover effects are 
confounded with other effects in the study, and the significance of the carryover ef- 
fects cannot be tested from an analysis of the model in Equation (16.5). The 
observed means for the subjects in each period of each sequence of the design, 7,,, 
V2 Yo15 Yo2, Contain all of the information available in the study to estimate the 
design factor effects for sequences and periods and the direct and carryover 
treatment... effects, ..The .expected...values...of the cell means, 
E(x) = vik = U+0; + 7% +74 + Ac, are Shown in Display 16.8. 


Display 16.8 Expected Cell Means for the 2 x 2 Crossover Design 
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Simplified expectations follow upon application of the relationship 7; + tT. = 0 
based on our definition of treatment effects. The sum of the two treatment effects is 


0 so that T2 = — 7, and — 7, can be substituted for 7) in the expectations. Similar 
substitutions can be made for the other factor effects. Given the substitutions 
Q2= — a1, 72 = —%J%, and Ay = — Aj, the expectations are 

BYGy) =H =eptat+y+71 
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The five parameters to be estimated are ju, aj, 71, 71, and A;, and only four ob- 
served means are available for estimation of the parameters. It is not possible to ob- 
tain estimates of parameters for some of the effects without involving the 
parameters of other effects. In particular, it is not possible to disentangle the se- 
quence effect a, or the direct treatment effect 7, from the carryover effect A; in the 
analysis. If one of the parameters can be eliminated from the model, the remaining 
parameters can be estimated from the four cell means. 


Eliminate Some Model Parameters to Estimate Effects 


A number of alternative models and methods of analysis have been proposed for 
the 2 x 2 crossover design (Grizzle, 1965, 1974; Hills & Armitage, 1979; Milliken 
& Johnson, 1984). Regardless of the model one parameter has to be eliminated to 
estimate the remaining parameters and test hypotheses about them. Under certain 
circumstances it may be possible to provide arguments for the elimination of se- 
quence or period effects from the model. Eliminating either of those parameters 
puts the analysis at risk because inferences are made under the generous assump- 
tion of either no period or no sequence effect. Effects are left in the model because 
it is necessary to test their significance in the study and not because it is believed 
they are actually present in the study. 

The test of significance for carryover effects from the 2 x 2 crossover design is 
possible only under the questionable assumption that some other effect is nonexis- 
tent. Under these circumstances the design should not be used. Jones and Kenward 
(1989) consider the design to be appropriate if the sequence, period, and carryover 
effects are negligible in size relative to the direct treatment effect. On the other 
hand, a panel for the U.S. Food and Drug Administration has recommended the de- 
sign not be used in drug evaluation. 


A Little Help from Baseline and Washout Observations 


Adjuncts to the 2 x 2 crossover design reviewed in detail by Jones and Kenward 
(1989) include the addition of baseline observations and washout period observa- 
tions. The baseline observations taken prior to any treatment administration enable 
the estimation of carryover effects and the test for their significance. However, the 
direct and carryover effects are still nonorthogonal in the design and the estimates 
of the direct and carryover effects are highly correlated, p = 0.87(07/n). Although 
the two effects are no longer confounded, the high correlation between their esti- 
mates leads to difficulties in distinguishing between their separate contributions. 
When estimates of effects are highly correlated, contradictory tests of significance 
can occur in the analysis. One effect may be nonsignificant when a second effect is 
in the model but significant when the second effect is removed from the model. 

The addition of observations on the washout period between the two-treatment 
period reduces the correlation between estimates of the two effects to 
p = 0.50(c?/n). Although the correlation is reduced, substantial nonorthogonality 
remains between the direct and carryover effects. The estimates are uncorrelated if 
observations are included from a second washout period after the second treatment 
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is terminated. Thus, a completely orthogonal design for two treatments is only pos- 
sible with added baseline and washout data that require a total of five measurement 
periods. 


Add an Extra Period to Estimate Carryover Effects 


The other adjuncts for the 2 x 2 crossover design reviewed by Jones and Kenward 
(1989) included designs for the two treatments administered in various combina- 
tions of sequences and periods. The addition of extra sequences or extra periods of 
treatment increases the ability to estimate the important effects in the crossover 
study. Two of the designs for two sequences in three periods will be discussed in 
this section. 

The switchback design for two treatments, shown in Display 16.9, has been 
used frequently as an adjunct to the 2 x 2 design to obtain estimates of the direct 
and carryover effects. However, the extra- period design for two treatments, also 
shown in Display 16.9, is a better design for the estimation of the treatment effects. 


Display 16.9 Switchback and Extra Period Designs 
for Two Treatments 
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The first two periods of both designs provide the basic 2 x 2 crossover design, 
but the designs differ in the treatment administered during the third period. The 
third period treatment reverts to the first period treatment in the switchback design, 
whereas the second period treatment is repeated in the third period of the extra- 
period design. 

Both designs have six cell means for the estimation of design and treatment 
factor effects, and the analysis of variance in Table 16.2 based on the model in 
Equation (16.5) can be used for the sum of squares partitions. An analysis of either 
design provides valid tests for all of the effects in the model. 


Extra-Period Design Superior to the Switchback Design 


Recall from the previous section on balanced designs that direct and carryover 
treatment effects are orthogonal in the extra-period design so that the estimates of 
direct and carryover treatment effects are uncorrelated. The effects are not orthog- 
onal in the switchback design, and the correlation between the direct and carryover 
effects is p = 0.87(07/n). 
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The additional superiority of the extra-period design resides in its greater effi- 
ciency for the estimation of direct and carryover treatment effects. The variances of 
the estimates for direct and carryover effects in the extra-period design are 


2 2 
of = 0.19( =) and o% = 0.25 (=) 
mr n 


whereas the variances for the effects in the switchback design are 


2 2 
018 (=) and o5 pe 1.0() 
n n 


The statistical properties of the extra-period design are superior to those of the 
switchback design since the estimates of the direct and carryover treatment effects 
are uncorrelated and the variances of the estimates are smaller. 

The choice of design to this point has focused on estimation and significance 
tests for the primary design and treatment factors in the crossover design: the se- 
quence, period, direct treatment, and carryover treatment effects. Other designs can 
be constructed to facilitate the estimation of additional effects that are potentially 
important in some studies. These effects include interactions between the direct and 
carryover treatment effects and interactions between the treatment effects and the 
design factors such as periods and sequences. Discussions on designs for the esti- 
mation and tests for these additional model effects can be found in Jones and 
Kenward (1989). 


EXERCISES FOR CHAPTER 16 


1. A pharmaceutical manufacturer conducted a crossover study with an anticonvulsant drug (DPH) 
used in the management of grand mal and psychomotor seizures. A single dose of DPH was given 
to a subject, and the plasma level of the drug was measured 12 hours after the drug was adminis- 
tered. The four treatments were (A) 100 mg generic DPH product in solution, (B) 100 mg 
manufacturer DPH in capsule, (C) 100 mg generic DPH product in capsule, and (D) 300 mg 
manufacturer DPH in capsule. _ 


A 4 x 4 Latin square crossover design was used for the study. Two subjects were assigned to 


each of the four treatment sequences. A single dose of each treatment was administered to each 
subject with an interval of two weeks between treatments. The subject’s plasma levels of DPH are 
shown in the table at the end of the exercise. 


a. 
b. 


Is the design balanced for carryover effects? Explain. 

Write a linear model for the study, explain the terms, and state the assumptions necessary for 
the analysis. 

Compute the analysis of variance for the data, and test the significance of the direct and carry- 
over effects. 

The manufacturer wanted to know if the bioavailability of DPH in the capsule form was as 
high as that for the liquid form, whether the bioavailability of the generic product was 
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equivalent to their product, and whether there was a dosage effect (300 mg versus 100 mg) on 
the bioavailability. Devise a meaningful set of contrasts among the four treatments, test their 
significance, and interpret the results. 

e. Obtain the residual plots for the analysis, and interpret them. 


ence sr Ss lS SSS 


Period 
Sequence Subject I il ITT IV 
ABDC l is LL? 1.8 4.5 
2 1.0 i 1.1 3.3 
BCAD 3 1.9 4.4 1.4 |e 
4 0.9 es | 1.0 0.8 
CDBA 5 32 0.8 0.8 0.9 
6 22 0.8 0.9 0.8 
DACB 7 1.4 £2 3.6 i] 
8 0.9 0.8 2.9 0.8 


ener nnn nn SSS SSS SSS 
Source: K. S. Albert et al. (1974), Bioavailability of diphenylhydantoin. Clinical Pharmacology 
and Therapeutics, 16, 727-735. 


A crossover design was used to compare drugs for the control of hypertension. Two drugs, A and 

B, were used alone and in combination. The combination of the two drugs was labeled as drug C in 

the experiment. Subjects were randomly assigned to one of six sequences of the drug treatments. 

Each treatment period lasted four weeks with no washout period between treatments. The systolic 

blood pressure of the subjects measured at the end of each period is shown in the table at the top of 

the next page. 

a. Is the design balanced for carryover effects? Explain. 

b. Write a linear model for the study, explain the terms, and state the assumptions necessary for 
the analysis. | 

c. Compute the analysis of variance for the data, and test the significance of the direct and 
carryover effects. 

d. Compute the contrasts between drug C (the combination of A and B) and drugs A and B 
separately along with the standard error. Is the systolic blood pressure for the combination of 
the two drugs significantly different from that of the two drugs alone? Do drugs A and B differ 
significantly with respect to systolic blood pressure? 

e. Obtain the residual plots for the analysis, and interpret them. 


A digestion trial with beef steers was conducted in an extra period Latin square crossover design to 
evaluate the effects of low-quality roughage on feed digestion. The low-quality roughages used in 
the trial were (A) cottonseed hull, (B) bermuda straw, and (C) wheat straw, and the high-quality 
roughage used as a control was (D) alfalfa hay. One steer was randomly assigned to each sequence 
of four diets. The steer remained on each diet for 30 days and measurements on dry matter diges- 
tion were made during the last week of the trial allowing a 21-day adjustment to each diet. The 
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Period 
Sequence Subject I II II 
ABC 1 174 146 164 
2 145 125 130 
3 230 174 200 
4 240 130 195 
ACB 5 P92 150 160 
6 194 208 160 
a 175 152 175 
8 202 160 180 
BAC 9 184 192 176 
10 140 150 150 
11 15 230 226 
2 180 185 190 
BCA 13: cf 86 132 138 
14 145 154 166 
> 194 210 190 
16 180 180 190 
CAB ry 206 220 210 
18 160 180 145 
19 188 200 190 
20 185 197 182 
CBA 21 180 180 208 
Ze 210 160 226 
23 185 180 200 
24 190 145 160 


Source: B. J. Jones and M. G. Kenward (1989), Design and Analysis of 
Cross-Over Trials. London: Chapman Hall. 


roughage diet fed in the fourth period was repeated during the fifth period. The data on dry matter 
digestion for each steer in each sequence are shown in the table. 


Period 
Steer I IT Il IV V 
1 75(A) 76(B) 79(C) 81(D) 79(D) 
2 TMC) T3A) . 79@D) TSB) -T7@) 
3 81(D) 79(C) 75(B). 72(A) — 73(A) 
4 76(B) 79(D) 72(A) 76(C) 73(C) 
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b. 


e. 


, 


Is there an advantage to this design over a simple two-period design without using the third 
period as shown in the data table? Explain. 

Write a linear model for the study, explain the terms, and state the assumptions necessary for 
the analysis. 

Compute the analysis of variance for the data, and test the significance of the direct and carry- 
over effects. 

Compute the contrast between drugs A and B along with the standard error. Do drugs A and B 
differ significantly with respect to diastolic blood pressure? 

Obtain the residual plots for the analysis, and interpret them. 


A crossover study was conducted to evaluate four keyboard layouts. Twelve volunteers experi- 


enced in a common keyboard configuration were used in the study. Each subject used the four test 
layouts in sequence. Each subject was randomly assigned to a sequence of layouts. Each layout was 
used for four days in their ordinary data and text entry activities. On the fifth day they were all giv- 
en a common task to perform with their assigned layout, and the number of errors on the task were 
recorded. None of the subjects knew they were being tested on the final day. The number of errors 
recorded for each subject on each layout are shown in the table. 


Period 

Subject 1 II III IV 
Lind TED) ie te IC) 
Que EGA) « Anne GO)c 2348) 
3. OC): A 7) 
dae) CIA) 
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Go WD) ey AD 
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S< 26) 245-7) 40) 
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Is the design balanced for carryover effects? Explain. 

Write a linear model for the study, explain the terms, and state the assumptions necessary for 
the analysis. Do you think an assumption of normal distribution for the observations is valid 
for this study? Explain. 

Compute the analysis of variance for the data, and test the significance of the direct and carry- 
over effects. 

Obtain the residual plots for the analysis, and interpret them. Do you think a transformation of 
the data will improve the analysis? If so, which transformation might be appropriate? 
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16A.1 Appendix: Coding Data Files for Crossover Studies 


Carryover Effects Coded as Factor Levels 


This coding scheme was suggested for the GLM program in SAS by Ratkowsky, 
Alldredge, and Cotton (1990). The correct sum of squares partitions are produced 
also by the MANOVA program in SPSS with this coding scheme. The estimates of 
the adjusted treatment means may differ by some constant amount between pro- 
grams because of their differing computing algorithms. However, estimates of con- 
trasts among the treatment means and standard errors for the contrasts will be 
consistent among the programs. Some programs will not complete the analysis with 
this coding scheme. 

The basic codes for levels of a factor in a data file ordinarily are a sequence of 
integers, 1,2,3,... . A code for the level of each factor is required for each obser- 
vation in the data file. However, the carryover effects do not occur with observa- 
tions in the first period of the crossover study, and there is no “level” to code for 
carryover effects. Some programs will allow a code for the factor even though the 
factor does not occur for the observation. For the examples in this chapter the levels 
of the carryover factor were identified with the levels 0,1,2,...,¢, where the “0” 
code was used for the first-period observations because there were no carryover ef- 
fects present in the first period. The observation was given a code of “1” if treat- 
ment | occurred in the previous period, a code of “2” if treatment 2 occurred in the 
previous period, and so forth. 

The complete coding for the data file used in Example 16.1 included columns 
of codes for the period (P), sequence (S), animal within sequence (A), diet (D), and 
carryover (C ) factors as well as the column of data for the response variable NDF. 


Oe ee CP CUCU UNDFUP US A O~D UC NDF 
Wee ae es ee et) 7 woe 6 A [3 2 
Ss ee ee ee ee ee ee Oe oe ee ee ee 
SS SS Pe Oe Pere: TR NB ee ee ee 
ee ee See ee ee ta ee oe Sa ae eee 
es ee ee eee yee es ee ee ee ee ee ee 
{i 4 sien @ < t (a a eee 7 5 25 4) 
ti 4 pporpy 9 © 54 - i 4 cig 4 8 Ss fe 3S 
1 4°29 1 Geese + 2.935 (51 ee ee ee 
1-5 1 2 fs 4 tS 8 ae ee 
ieee ates ala eet | Saas oe er a9 mS 4 i Oe 
64 2 3 4 pe Gaia: taatay bids Sea Loe es ee 
el ae ee ee Se es RS Sa oe es eee 


Note the coding for the carryover factor C is “0” in the first period (P = 1). 
The carryover factor code in the second period (P = 2) is identical to the diet (D) 
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code for the same steer in the previous period (P = 1). Likewise, the diet codes for 
the second period are those used for the carryover code in the third period. 


Carryover Effects Coded as Covariates 


Milliken and Johnson (1984) introduced the carryover effects into the model as co- 
variates. The method of coding is sometimes referred to as indicator or dummy var- 
lable codes. The coding method is based on the knowledge that a restriction must 
be placed on the solutions for factor effects in the normal equations. One restriction 
on the relationship among the solutions that can be used for the carryover effects is 
Xi oe iy +---+ be = (). The restriction indicates that knowledge of ¢ — 1 effects 
automatically provides the value for the remaining effect. The relationship is 
ay a oe ae ce = 0 with three-carryover treatment effects. Knowledge of the values 
for two-carryover effects automatically provides the value for the third. Therefore, 
se = — a — x and carryover effects have 2 degrees of freedom. 

Only t — 1 covariates are required for the linear model to include parameters 
for carryover effects because of the restrictions on the carryover effects. The model 
for the roughage diet study is used as an example. The model expressed with two 
covariates for carryover effects is 


Yisk = Wt a; +5; +e + Ta + ALA + A2Xe + Ci5K (16A.1) 
f=42: 6 gee 


One method of coding the covariates for the carryover effects in period k is 


1 if d =1 in period (k — 1) 
ATS —1 ifd=23 in period (k — 1) 
0 otherwise 


Thus, X; = 1 when diet A occurs in the previous period and X; = — 1 when diet 
C occurs in the previous period. 
Also, 


1 ifd =2 in period (k — 1) 
X= —1 ifd=3 in period (k — 1) 
0 otherwise 


Thus, X2 = 1 when diet B occurs in the previous period and Xz = — 1 when diet 
C occurs in the previous period. 

The complete coding for the data file for Example 16.1 includes columns of 
codes for the period (P), sequence (S), animal within sequence (A), diet (D), and 
covariates for the carryover effects X,; and Xo, as well as the column of data for the 
response variable NDF. 

There is no carryover effect in the first period (P = 1), and the covariates are 
coded X; = X_ = 0. The observation on diet B in the second period (P = 2) of the 
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first sequence (S = 1) follows diet A from the first period; thus, a carryover effect 
for diet A is required and the covariate codes are X; =1 and X2,=0. The 
observation on diet A in the third period (P = 3) of the second sequence (S = 2) 
follows diet C from the second period; thus, a carryover effect for diet C is required 
with covariate codes X; = —1 and X2 = -1. 

The sum of squares for the two covariates, X, and X92, will be the sum of 
squares for carryover effects adjusted for direct effects. This method for the analy- 
sis can be used with any statistical program that allows the inclusion of covariates 
in the model. Each program will have unique syntax for the representation of added 
covariates in the model. | 


16A.2 Appendix: Treatment Sum of Squares for Balanced 
Designs 


The quantities shown next for manual calculation of the adjusted sums of squares 
for direct and carryover effects in the balanced crossover designs and balanced 
extra-period designs are valid for equal numbers of subjects per treatment se- 
quence. Following the notation of Cochran and Cox (1957) the required formulae 
are 


T; = treatment totals, ¢ =1,2.......,. 

R; = total of observations in the periods immediately following treatment 2 
F;, = total of sequences in which treatment z is the final treatment 

P, = total of period 1 

G = grand total of all observations 


Also, let t = the number of treatments, r = the number of subjects per sequence, 
and m = the number of Latin squares used for the design. 
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Williams Designs: For the Williams designs compute 

T= -t-Dh4+tR, +R+PR,-tG 
for the adjusted sum of squares for direct treatment effects 
af. 
SST (adjusted) = ri? —t— 1+ Ne =D 
Compute 

R, =tT, +R, +t 4+tP, — (t+ 2G 

for the adjusted sum of squares for carryover treatment effects 
ELA 
S'SC (adjusted) = ie BESS 


Compute the usual unadjusted sums of squares for periods, sequences, subjects 
within sequences, and treatments according to the Latin rectangle design in Chapter 
8. The sum of squares for experimental error to complete the sum of squares parti- 
tion for the analysis of variance as shown in Table 16.2 is 


SSE = Total SS —SSS —SSW — SSP — SST (unadjusted) — SSC 


The estimates of differences between the adjusted treatment means can be 


found as | 
a 
MT) emit + 1KE— 2) 


The standard error estimate for the contrast with adjusted means from a balanced 


crossover design is 
40% PAM SE nde) 
APO oily pelo cind teal Yer) 


where JN is the number of replications per treatment. 


Extra-Period Williams Design: Adjustments to the calculations required for the 
extra-period Williams designs are 


T,=¢+)R-K-G 
with the adjusted sum of squares for treatments as 
Ee 


SS T'(adjusted) = mit + I(t +2) 


Compute 


— 


R,=tR,+A-G 
for the adjusted sum of squares for carryover effects as 


a2 


SSC(adjusted) = a 


The estimated difference between two adjusted treatment means is 


— -_ 


foes ar 
mt(t + 2) 


_ 


be TF = 
with standard error estimate 


 BMSE G+) 
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ve Analysis of Covariance - 


The use of additional information in the experimental units as a local control prac- 
tice to reduce the estimate of experimental error is the primary subject of this chap- 
ter. The values for treatment means in a research study may depend on covariates 
that vary among the experimental units and have a significant relationship with the 
primary response variable. The analysis of covariance is used in this chapter to re- 
move the influence of the covariates on treatment comparisons in completely ran- 
domized and complete block designs. 


17.1 Local Control with a Measured Covariate 


550 


A number of local control techniques are used in experiments to control experi- 
mental error variance. Local control practices reduce experimental error variance 


and increase the precision for estimates of treatment means and tests of hypotheses. 


Concomitant variables are often used to select and group units to control experi- 
mental error variation. 

Many concomitant variables, or covariates, can be measured at any time dur- 
ing the course of the experiment, and their influence on the response variable can 
be assessed when analyzing the results. The analysis of covariance, combining re- 
gression methodology with the analysis of variance, evaluates the influence of the 
covariate on the response variable and enables the comparison of treatments on a 
common basis relative to the values of the covariates. 

Often, many factors external to the treatment factors influence the response 
variable. Blocking on the basis of these influential factors is one of the primary 
means used by researchers to control experimental error. When blocks of units are 
constructed with similar values for the factors, the experimental treatments can be 
compared with one another in a more homogeneous environment. 
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Frequently, the experimental setting ray prohibit blocking of like units for a 
variety of reasons. There may be incomplete knowledge about the experimental 
material or the effects of external factors may not appear until after the experiment 
has begun. Too few units of like value may exist for adequate blocking. Even 
though the investigator may have knowledge of the influential factors, the number 
of additional factors may prohibit the use of all of them as blocking criteria. 

Typical studies with additional variables that influence treatment comparisons 
include: 


e Clinical trials where age, weight, sex, previous medical history, or occupa- 
tion of patients can influence their response to the treatments. Although the 
investigators can block on two or three of these factors, it is not possible to 
ignore the influence of the remaining variables. 


e Trials with fruit trees, which are blocked on the basis of soil or irrigation 
gradients, but the influence of previous treatments and historical yield rec- 
ords cannot be ignored. 


e Feeding trials where blocks of animals can be constructed on the basis of 
litter and initial weights, but the amount of feed consumed during the 
course of the study will also have influence on the measured weight gains. 


The study described in Example 17.1 illustrates an experiment in which the re- 
sponse variable is affected not only by the treatment applied to the subjects of the 
study but also by a covariate that was measured on each subject prior to the study. 


Example 17.1 Effects of Exercise on Oxygen Ventilation 


A common clinical method to evaluate an individual’s cardiovascular capac- 
ity is through treadmill exercise testing. = = thes measures Somnee eb 
treadmill testing, maximal oxygen uptake, is consi 

work capacity and maximal cardiovascular function. 

‘The measured maximal oxygen Tanke ie an ‘individual depends on a 
number of factors, including the mode of testing, test protocol, and the sub- 
jects’ physical condition and age. A common test protocol on the treadmill is 
the inclined protocol where grade and speed incrementally increase until ex- 
haustion occurs. 


Research Hypothesis: Researchers in exercise physiology were of the opin- 
ion that the conditions for treadmill testing should simulate as closely as pos- 
sible the mode of subject gS tle to attain their maximal oxy- 
gen uptake during the test. It was hype that step aerobic training was 
better simulated by the inclinec od F otc at ’ errain running regimen. 


Treatment Design: Two treatments scheited for the study were a 12-week 
step aerobic training program and a 12-week outdoor running regimen on flat 
terrain. The subjects were to be tested on a treadmill before and after the 12- 
week training period with the inclined protocol. If the hypothesis were true, 
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the subjects from the step aerobic training would show greater increases in 
maximal oxygen uptake than the subjects trained only on flat terrain. Those 
trained on flat terrain might be limited by localized muscle fatigue when 
tested on inclined protocols and thus not attain maximal oxygen uptake 
before exhaustion. 


Experiment Design: The subjects were 12 healthy males who did not partici- 
pate in a regular exercise program. a x men were randomly assigned to each. 
group in a completely randomized design. Various respiratory measurements 
were made on the subjects while on the treadmill before the 12-week period. 
There were no differences in the respiratory measurements of the two groups 
of subjects prior to treatment. 

The measurement of interest for this example is the change in maximal 
ventilation (liters/minute) of oxygen for the 12-week period. The observa- 
tions on the 12 subjects and their ages are shown in Table 17.1, along with 
the group means and standard errors. 


Table 17.1 Maximal ventilation change (liters/minute) following a 12-week exer- 
cise program 


Group Age Change Group Age Change 
Aerobic a] 17.05 Running 23 — 0.87 
23 4.96 22 — 10.74 

aT 10.40 22 ee 

28 11.05 25 — 1.97 

22 0.26 Zi 7.50 

24 2.51 20 — 7,25 

Mean 25.83 rere Pee S| — 2.77 
Std. Err. 1.40 2355 1.01 2.54 


Source: D. Allen, Exercise Physiology, University of Arizona. 


Is Treadmill Performance Related to Age? 


A graph of the change in maximal ventilation (y) for each of the subjects with their 
age (x) on the horizontal axis is shown in Figure 17.1. The plot suggests a 
strong positive relationship between the age of the subjects and their change in 
maximal ventilation on the treadmill regardless of the treatment group to which 
they belong. Thus, there appears to be considerable experimental error variation 
within each group associated with age differences. 

The study protocol required the eligible subjects to be healthy males between 
the ages of 20 and 35 with a sedentary lifestyle. Although the two groups of sub- 
jects did not differ in their average pretest maximal ventilation measures, a one-way 
aaalesis of variance reveals a significant difference between the two groups in the 

nal ventilation change after training. The aerobic exercise group had a larger 
elitr’ in ventilation rate than the running group, but the aerobic group consists of 
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Figure 17.1 The relationship between maximal ventilation and age in the exercise 
physiology study 


an older group of males. It must be determined whether the difference is a result of 
the exercise or the age differences in the groups. That is, suppose the mean changes 
in maximal ventilation were compared at the same age for both groups. Would the 
aerobic exercise group mean still be significantly greater than the running group 
mean? The analysis of covariance can be used to help answer that question and to 
determine whether the relationship between maximal ventilation change and age 
contributes significantly to experimental error variation. 


17.2. Analysis of Covariance for Completely Randomized 
Designs 


The Linear Model and Analysis of Covariance 


The experiment on exercise physiology was conducted in a completely randomized 
design with two treatment groups. Assuming a linear relationship between the re- 
sponse variable y and a covariate x the linear model for the completely randomized 
design is 7 


Ue SP a — EY ei; (17.1) 
f=12...4 = 1,2,....¢ | 


where ju; is the treatment mean, @ is the coefficient for the linear regression of yj; 
on 2x;;, and the e;; are independent, normally distributed random experimental 
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errors with mean 0 and variance o*. Two additional key assumptions for this model 


are that the regression coefficient @ is the sam nt_ groups, and the 


treatments do not influence the covariate zx. 

The first objective of the covariance analysis is to determine whether the addi- 
tion of the covariate has reduced the estimate of experimental error variance. If the 
reduction is significant, then we obtain estimates of the treatment group means 7,, 
adjusted to the same value of the covariate x for each of the treatment groups and 
determine the significance of treatment differences on the basis of the adjusted 
treatment means. 


Alternative Models to Evaluate the Covariate Contribution 
The analysis will require least squares estimates of the parameters for the alterna- 
tive full and reduced models, which are 

e the full model Yij = i + B( xi; — @.)+ Cij 

e areduced model without the covariate y;; = 4; + e;; 

e areduced model without treatment effects y;; = take Mate 2) + 6; 

The reduced model without the covariate is required to assess the influence of 
the covariate, and the reduced model without the treatment effects is required to as- 
sess the significance of treatment effects in the presence of the covariate. 


Least squares estimates of the parameters are derived for the full model to 
obtain 


SSE; = Xlyis — B; — Bey — BP 
with NV — t — 1 degrees of freedom; the reduced model without the covariate for 
SSE, = & (yis — fi)? 


with NV —t degrees of freedom; and the reduced model without treatment effects 
for | 


SSE, =u a 2 Bais a Z| 


with NV — 2 degrees of freedom. 
The sum of squares reduction due to the addition of the covariate x to the mod- 
el is obtained as the difference 


SS(Covariate) = SSE, — SSE; 


with 1 degree of freedom. The adjusted treatment sum of squares after fitting the 
covariate is 
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SST (adjusted) = SSE* —SSE; 


with t — 1 degrees of freedom. 
The SSE for each of the three models fit to the exercise physiology data 
(Table 17.1) are SSE; = 70.39, SSE, = 389.30, and SSEF = 142.18. 


Sum of Squares Partitions for the Analysis of Covariance 


These sum of squares reductions can be computed by most computer programs 
for the analysis of covariance, and they will produce the required information found 
in the following discussion. The sum of squares partitions for the change in maxi- 
mal ventilation rate with the age covariate in the exercise physiology study is 
shown in the analysis of variance in Table 17.2. Notice without the age covariate 
the estimated experimental error variance is MSE,=SSE,/(N —t)= 
389.3/10 = 38.93. The addition of the covariate has reduced the estimate to 
MSE =7.82 in Table 17.2. Thus, use of the age covariate as local control to re- 
duce o” through covariates appears to be effective. The gain in efficiency due to the 
covariate is illustrated later in this section. 


Table 17.2 Analysis of covariance for maximal ventilation change with age co- 
variate in an exercise physiology study 


Source of Degreesof Sumof Mean 

Variation Freedom Squares Square F i we et od 
Regression 318.91 318.91" 40.78 .000 
Group l 71.79 71.79% 9.18 014 
Error 9 70.39 7.82? 


*MS(Covariate) ‘MST(adjusted) *A SEs 


Tests of Hypotheses About Covariates and Treatments 

Determine the significance of the reduction due to the covariate with a test of 
the null hypothesis Ho: 3 = 0. The test statistic is 
_ MS(Covariate) 318.91 
oe oe ee 


Fo = 40.78 (17.2) 
with critical value F'95,;.9 = 5.12. The null hypothesis is rejected with Pr > F' = 
.000 in Table 17.2; the addition of the covariate has significantly reduced experi- 
mental error variance. 

The significant relationship between change in maximal ventilation rate and 
age of the subjects indicates the necessity to assess the significance of the treatment 
effects after the covariance adjustment. The null hypothesis for the adjusted treat- 
ment means is Ho: 1 = [2, and the test statistic is 
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_ MST adjusted) — 71.79 


Fo MSE Bias 9. 


= 9.18 (17.3) 


with critical value F'95,1.9 = 5.12. The null hypothesis is rejected with Pr > F = 
0.014 in Table 17.2, and we conclude that treatment means adjusted for age are 
different. 


Treatment Means Adjusted to a Common Covariate Value 


The estimates of the treatment means are adjusted to a common value for the covar- 
iate if inclusion of the covariate in the model significantly reduced experimental er- 
ror variance. The treatment means can be adjusted to any value of the covariate, but 
ordinarily they are adjusted to the overall mean Z_ as 


Via = Ui, — Bi. — F.) (17.4) 


The estimate of the regression coefficient, which most programs compute auto- 
matically, is 


t r. : 
_ * 2, is = Be Mths. ,.) EON 
gels aes = a = 1.886 (17.5) 
pe ee 


1 j=1 


». 

lI 
a) 

II 


and the estimated regression equation for the ith treatment group will be 


9; = i, + Bai — Z) (17.6) 


and are 


91; = 7.70 + 1.886(21; — 25.83) 


for the aerobic group and 


Dog = — 2.77 + 1.886(22; — 23.17) 


for the running group. The regression line for each of the treatment groups is shown 
in Figure 17.2. 
The treatment means adjusted to the mean age Z. = 24.5 are 


Vig = 7.70 — 1.886(25.83 — 24.5) = 5.19 
Yo. = — 2.77 — 1.886(23.17 — 14.5) = — 0.26 


The unadjusted and adjusted treatment means are shown with the computed regres- 
sion line for each of the treatment groups in Figure 17.2. 
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Figure 17.2 Regression between maximal ventilation and age in the exercise 
physiology study with adjusted treatment means 


The difference between the unadjusted treatment means y, —Yy, = 
7.71 — (— 2.77) = 10.48 was much greater than the difference between the treat- 
ment means adjusted to a common age, Y,4 — Yo, = 5.19 — (— 0.26) = 5.45. Part 
of the difference between the unadjusted means was due to the difference of over 
two years in the average ages of the subjects in the two treatment 
groups, Z;, = 25.83 and Z2, = 23.17. The adjusted means are estimates of the mean 
maximal ventilation change at a common age. Thus, the difference between the ad- 
justed means reflects only the effect of exercise on the change in maximal ventila- 
tion rate. 


Standard Errors for the Adjusted Treatment Means 


Two quantities useful for calculation of standard errors and relative efficiency 
in the analysis of covariance are the sums of squares for treatments, T;,, and ex- 
perimental error, /,,, from an analysis of variance for the age covariate, x. These 
sums of squares and their values for this study are 


t 


Ti: = aa —~z¢)y=21.31 
3 | 


and 


t 
Eve =))) (xij — Zi)? = 89.67 
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The standard error estimator for an adjusted treatment mean is 


sy, = (17.7) 


| Ee eit oom 


The estimated standard error for the adjusted aerobic exercise group mean is 


78) l a (25.83 — 24.50)? 12] 
Soa = — + ——_____—__ | = ]]. 
— 6 89.67 


The standard error estimate for the adjusted running group mean will be the same 
as that for the exercise group mean since the quantity (;, — Z_)” in Equation (17.7) 
is the same for both treatment groups. 

The standard error estimator for the difference between two adjusted treatment 


means is not always available from computer programs without some clever pro- 
gramming on the part of the user. It is calculated as 


(17.8) 


The difference between the adjusted treatment means for the aerobic exercise and 
running groups iS J, — You = 5.19 — (— 0.26) = 5.45 with standard error estimate 


POY" 2583-17 


Bin ie) Seg Oks = $30 
5 (Tia Va) E 6 89.67 | 


Even when all 7; are equal, the standard error of the difference will vary among the 
pairs of treatments with more than two treatment groups because of the term 
(Z;, — Z;,) in Equation (17.8). In practice the variation in the estimate is slight. A 
single average standard error of the difference suggested by Finney (1946) to sim- 
plify the analysis with equal replication numbers is 


aaa 17. 
Sia Via) r = (t <= =| ( 9) 


The substitution of T,, for(%;. — %..)* in Equation (17.7) provides the average 
standard error for the adjusted treatment means. 


Was There Increased Efficiency with a Covariate? 


Whether the covariance adjustment has been worth the required effort depends on 
the gain in efficiency of the estimated means. The efficiency of the covariance ad- 
justment relative to the analysis without the adjustment is based on the ratio of re- 
spective variances for the estimates of treatment means. The estimate of 
experimental error variance from the reduced model without the covariate is 
MSE, = SSE,/(N — t) = 389.3/10 = 38.93. The average variance suggested by 


Ftaoc | 
jolevactinn? 


(Br the Regression Coefficient the Same for All Treatments? 
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Finney (1946) can be used for the estimate with the covariance adjustment. The 
efficiency is calculated as 
MSE, 


s 
MSE |I+ ah, | 


E= (17.10) 


Gwen MS &. 338.93. 7... = 21.33, h.; = 89.67, MSE = 782, and t =2 the 
efficiency of the covariance adjustment is EF = 38.93/9.68 = 4.0. Thus, without 
covariance adjustment for age, four times as many subjects would be required for 
the exercise study to achieve the same precision on the estimated treatment means. 


Critical Assumptions for a Valid Covariance Analysis 


The validity of inferences from the analysis of variance requires an assumption of 
independent and homogeneous normally distributed errors. An evaluation of the as- 
sumptions regarding homogeneous and normally distributed experimental errors 
can be achieved with an estimate of the residuals for each observation as 
yj 3 - F,. = B(x; — %;.). Many computer programs will supply estimates of 
the residuals that can be used for the normal plots and plots of residuals versus esti- 
mated values to evaluate the assumptions as outlined in Chapter 4. 

Additional assumptions critical to the validity of inferences from thefanalysis 
of covariance are ne ® the covariate x is unaffected by the treatments, (2) a linear 

exists between the response variable y and the covariate x, and (3) the 
regression c coef feient Bi is the same for all treatment groups. 


fit YoyX 
roo Xe 


(Vo the Treatments Affect the Covariate? 


If the covariate x as well as the primary response variable y is affected by the 
treatments the resultant response is multivariate, and the covariance adjustment for 
treatment means is inappropriate. In these cases, an analysis of the bivariate re- 
sponse (z, y) utilizing multivariate analysis methods is in order. Adjustment for the 
covariate is appropriate if it is measured prior to treatment administration since the 
treatments have not yet had the opportunity to “arteeE its value. If the covariate is 
measured concurrently with the response variable, then it must be decided whether 
it could be affected by the treatments before the covariance adjustments are 
considered. 


yng adj reatment means are independent of the covariate 
value if the regression fines ‘ca the ate: groups are parallel. If the relationship 
between y and «x differs among the treatment groups as shown in Figure 17.3, then 
differences between adjusted treatment means depend crucially on the level of x 
chosen for the adjustment. 

This heterogeneity of regression coefficients among treatment groups 
resembles interaction between factors in the standard factorial treatment design. In 


fit Vy X 


Fit ey ¥ ——€ 


Y- Poflaey X- Covariak 
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Figure 17.3 Differing regression relationships between the response variable y 
and the covariate x 


that case comparisons are made with simple effects of one factor at each level of 
other factors. A similar strategy must be used in the case of continuous covariates 
with different regression coefficients for the treatment groups. 
Similar difficulties occur with nonlinear relationships. Under these circum- 
stances the inferences regarding the responses must include a complete description 
nw seinwalving the effects of the treatments and the covariate. 


@ 


Evaluation of the Separate Regressions Model 


The Linear Model with Separate Regressions for Each Treatment 


The linear model with different regression coefficients for each of the treat- 
ment groups is 


U5 SOF Pie HS TL) ee (17.11) 
i= bh Qee + 3557 ee 


ren, Uwhere (; 1s the regression coefficient for the zth treatment’ group. A test for a 
equality of the regression coefficients, Ho: 3; = Go =--- =, can be: -derived . 
from an analysis of the two alternative models, the Hg with a common regres- 
sion for all treatments and the model with separate regressions for the treatments. 
Different regressions imply the presence of an interaction between the treatments 
and the covariate; thus, the model alternatively can be written to include a term for 
interaction between the covariate and treatments. 
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Example 17.2 Auditory Discrimination and Cultural Environment 


Hendrix, Carter, and Scott (1982) described a study conducted to determine 
the difference in the ability to discriminate aurally between environmental 
sounds with respect to several factors. The study was designed to test the ef- 
fects of a treatment on auditory discrimination. Subjects belonging to two dif- 
ferent cultural groups were given pre-treatment and post-treatment tests on 
auditory discrimination. 

Only a portion of the data is used for this example to illustrate the effects 
of heterogeneous regression coefficients for the covariate. The analysis will 
be conducted to determine whether the gain in auditory score between the 
pre- and post-treatment administration differed between the subjects with pre- 
test scores as a covariate. The data for gain in scores and pre-test scores from 
female subjects in the auditory treatment group are shown in Table 17.3. 


Table 17.3 Pre-test and gain scores for auditory discrimination in two cultural 
groups 


Culture Pre-Test Gain Culture Pre-Test Gain 

| 64 7 2 52 Z 
39 Re 50 12 

69 2 59 6 

56 20 42 10 

67 4 62 | 

39 26 | i 9 

a2 34 4] 6 

62 8 36 8 

64 4 31 8 

66 2 64 6 

Mean _ Se 13.6 47.8 6.8 


A Sum of Squares Partition for Homogeneity of Regressions 


Let the model y;; = yw; + (aij — &.) +e; im Equation (17.1), assuming a 
common slope, be model 1 with the usual t + 1 parameters, ; and 3. Let the mod- 
el yi = Wi + Bi(xij — Xi.) + ej; in Equation (17.11), assuming different slopes for 
each treatment, be model 2 with 2¢ parameters, 4; and (;. The sum of squares for 
experimental error from model 1, say SSE, will have (N —t-— 1) degrees of 
freedom, and the sum of squares for experimental error from model 2, say SS E4, 
will have (NV — 2t) degrees of freedom. 

The analysis of variance of the auditory discrimination data for model 1 in 
Table 17.4 contains SSF, = 519.38 with 17 degrees of freedom. The analysis of 
variance for model 2 is shown in Table 17.5 with 2 degrees of freedom for the sum 
of squares reduction for separate regressions calculated within each culture group. 
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The error sum of squares has been reduced to SS EK. = 147.22 with 16 degrees of 
freedom. 


Table 17.4 Analysis of covariance for score gain with pre-test score covariate in 
the auditory discrimination study 


Source of Degreesof Sumof Mean 
Variation Freedom Squares Square F TSS 
Regression ] 1061.12 1061.12 34.73 .000 
Group 647.47 647.47 21.19 .000 
Error 17 51938" 30.55 

“SSE; 


Table 17.5 Analysis of covariance score gain assuming separate regressions of 
gain on pre-test scores for each cultural group 


Source of Degrees of Sumof Mean : 
Variation Freedom Squares Square F a eee a 
Group l 610.24 610.24 66.33 .000 
Regression within groups 2 1433.28 716.64 77.89 000 
Error 16 147,.22* 9.20 

“SSE 


The sum of squares to test homogeneity of regression coefficients for the treat- 
ment groups is the difference in the experimental error sum of squares for the two 
models or 


SS(Homogeneity) = SSF, — SSE» (17.12) 
= 519.38 — 147.22 
== 372.76 


with (V —t—1)—(N — 2t) = (t— 1) or 17 — 16 = 1 degrees of freedom. The Fo 
statistic to test the null hypothesis of equal regression coefficients, Ho: 
Bi = Be =~ = Bris 

_ _ MS(Homogeneity) 


fo = MSE, (17.13) 


with critical value Fy (+-1)(Nw-2:). The test for the exercise study is fo = 
(372.16/1)/9.20 = 40.45 exceeds F'o5,1,16 = 4.49, and the null hypothesis of equal 
regression coefficients for the two cultural groups is rejected. The regression of 
gain in score on pre-test scores is different for the two cultural groups, and adjust- 
ment of the cultural group means to the same pre-test score with a common regres- 
sion is not appropriate. 
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Many statistical programs are capable of directly computing the required sums 
of squares for homogeneity of regression coefficients in Equation (17.12) when 
specified as a treatment by covariate interaction effect in the program. The analysis 
of variance for the covariance model with a treatment by covariate interaction term 
is Shown in Table 17.6. Note the sum of squares for the Groups x Regression inter- 
action is equivalent to the sum of squares for homogeneity, Equation (17.12), de- 
rived from the alternative models with common and different regressions for the 
treatment groups. 


Table 17.6 Analysis of covariance for score gain, including the interaction be- 
tween the pre-test score covariate and cultural groups 


Source of Degrees of Sum of Mean 

Variation Freedom Squares Square F rrr 
Group l 610.24 610.24 66.32 .000 
Regression l 770.63 770.63 83.75 .000 
Groups x Regression l 37216. - 37246 40.45 .000 
Error 16 147.22 9.20 


A Regression Coefficient Estimate for Each Treatment 


If the regressions are significantly different among the treatment groups the least 
squares estimate of the regression coefficient for the zth treatment group is com- 
puted from the z;; and y;; values within the treatment group as 


he Big — TYG - Fi) 


The estimated regression equation for the 7th treatment group is ¥; = 
Y,; + B (xi; — %;,). The estimates of the regression coefficients for the two cultural 


groups and their standard errors (in parentheses) are 2 = — 0.91(0.07) and S. = 
— 0.16(0.09). Both of these estimates are available from computer programs for 
linear models. The standard error estimate for the estimated regression coeffi- 


cient, Bs is the square root of MS E/ Ea (x4; — %;,)’. 
The estimated regression equations for the two cultural groups are 


Group |<. 8, = 35.8.— 0.91 — 13.6) 


Group2: J = 47.8 —0.16(z — 6.8) 


The regression equations are plotted in Figure 17.4 along with the observations for 
each cultural group. Clearly, the regression lines are different from one another and 
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a comparison of cultural group means adjusted to the overall pre-test mean of 
x = 51.8 would be relatively meaningless. 


Gain in score 


40 50 60 70 
Pre-test score 


Figure 17.4 Regression between gain in score and pre-test scores with separate 
regressions for cultural groups 


Rather, consider the regression lines as estimates of the simple effect of pre- 
test scores at each level of cultural group and make inferences on that basis. In both 
cultural groups the gain in score decreased with higher pre-test scores, but the rate 
of decrease was much greater with Group | than with Group 2. In fact a simple Stu- 
dent ¢ test for Ho : G2 =0 is tp = — 0.16/0.09 = — 1.78, which would lead to 
nonrejection of the null hypothesis; so no convincing evidence exists to indicate 
gain changes with pre-test scores in Group 2. 

It is evident that the gain in scores from treatment in Group 1 are much greater 
when the subjects had low initial auditory discrimination and appear to benefit 
much more from training than do those in Group 2 with similar low initial auditory 
discrimination. When subjects from either group had high initial auditory 
discrimination on the pre-test the gain from training was relatively low or 
negligible. 

Confidence intervals or tests of hypotheses between the estimated group means 
for any given value of pretest score x = x0, Say , Y1),, — Yojz,, can be constructed 
with the variance of the contrast as 

ee (ip — 2,)° (a 2) 


gas °.)) Se SE oe se ei 17.15 
ot Ti otis T Otek Dlg i. Mint ees — 2a)" ( ) 
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17.3. The Analysis of Covariance for Blocked Experiment 
Designs 


The analysis of covariance can be applied to any experimental design with a 
straightforward extension of the principles applied to the an ied pescado 
design in the previous section. Howeve , a test for equality of re ion coeffi- © 
oy am) ong reatn ps is not possible with blocked de: igns unless there is 


eacl ti ca it within each block. The analysis 


is ‘Mustrated with a a complete block design. 


Example 17.3. Nutrient Availability Tests with Barley 
in the Greenhouse : 


Management methods on forest and range watersheds affect the nutrient sta- 
tus and availability of nutrients in any vegetation and soil-type complex. 
Knowledge of the soil, plant, and nutrient relationships is essential to proper- 
ly manage watershed vegetation and soils. 

The availability of certain soil nutrients in these watershed soils is evalu- 
ated by a pot culture technique in a greenhouse with barley plants. In princi- 
ple, the method is based on the law of limiting factors. The test plants are 
grown in the soil fertilized to an optimum level and in the soil fertilized in 
identical fashion but without the nutrient in question. If the nutrient is defi- 
cient in the soil, the plants cultured in the complete nutrient soil will exhibit 
more plant growth than those cultured in the soil with the nutrient omitted 
from the fertilizer. 


Research Objective: In one such study an investigator wanted to determine 
the availability of nitrogen and phosphorus in a watershed dominated by 
chaparral vegetation. He had collected soil samples from under the canopies 
of the dominant vegetation in the watershed, mountain mahogany, and com- 
posited the samples for a pot culture evaluation of nitrogen and phosphorus 
availability. 


Treatment Design: Four nutrient treatments used for the study were (1) 
check, no fertilizer added; (2) full, a complete fertilizer; (3) No, nitrogen 
omitted from full, and (4) Po, phosphorus omitted from full. The nutrient 
treatments were added as solutions to the soil, mixed, and placed in plastic 
pots in the greenhouse. | 


Experimental Design: The treatment pots were arranged on a greenhouse 
bench in arandomized complete block design to control experimental error 
variation caused by gradients in light and temperature in the greenhouse. 

The barley plants were grown in the pots for seven weeks when plants 
were harvested, dried, and weighed. A leaf blight infected the plants part way 
through the experiment. It was assumed the blight would affect the growth of 
the plants and at the end of the experiment the percentage of the leaf area 
affected by the blight was measured in each container before the barley plants 
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were harvested. The total dry weight of the barley plants and the percent leaf 
area affected with the blight is shown in Table 17.7 for each container in the — 
experiment. 


Table 17.7 Total dry matter y in grams and percent blighted leaf area x of barley 
plants 


Treatment 
Check Full No Po 
Block y x y x y z y x 

1 25.1 13 30.1 7 26.4" 16 20.2 8 

z 20.9 12 31.8 5 (5 ee pas ge 9 

3 28.3 7 32.4 6 280% 6 29.7 7 

4 25.0 9 30.6 7 782% 5 26.0 if: 

5 25.1 8 21:5 9 50.5 FF 24.9 9 
Mean 24.48 9.8 30.48 6.8 26.5: fa 26.42 8.0 
Source: Dr. J. Klemmedson, Renewable Natural Resources, University of Arizona. 

The Linear Model for a Randomized Complete Block Design 
The effects model for the randomized complete block design can be expressed as 
, Vij = yt py + P@yg — T+ eg (17.15) 


(21 Fe 1,2,..28 


where pz is the general mean, 7; is the treatment effect, p; is the block effect, ( is 
the regression of y on z, and the e;; are independent and normally distributed ran- 
dom errors with mean 0 and variance o?. It is further assumed that the covariate is 
unaffected by the treatments or blocks and the regression is the same for all 
treatments. 


Alternative Models to Evaluate the Covariate Influence 


Alternative full and reduced models are used to evaluate the influence of the covar- 
iate and also the significance of the treatment effects after adjustment for the co- 
variate if necessary. The required models and their experimental error sums of 
squares are 


e the full model, y;;='v +7, + pj + Ci; — 7.) +e, and SSE; with 
(r — 1)(t — 1) — 1 degrees of freedom 


e the usual randomized complete block model with no covariate, y;; = 
++ pj +e;,and SSE, with (r — 1)(¢ — 1) degrees of freedom 
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e no treatment effects, y;; = u~+p; + B(x; — 7) +e;, and SSE* with 
(N — r — 1) degrees of freedom 


Finally, although not entirely necessary for the analysis, the adjusted sum of 
squares for blocks can be computed using the model with 


e no. block effects, y,,= +7, + P(%j;- 7)+¢,;, and SSE** with 
(NV —t — 1) degrees of freedom 


The SSE for each of the models fit to the barley data, Table 17.7, are 
SSE; = 12.577, SSE, = 39.167, SSE* = 37.437, and SSE™ = 20.363. 


Sum of Squares Partitions for the Analysis of Covariance 


The sum of squares reduction upon adding the covariate x to the usual randomized 
complete block model is obtained as the difference 
S'S(Covariate) = SSE, — SS Ey 


= 39.167 — 12.577 = 26.590 


with | degree of freedom. The adjusted treatment sum of squares after fitting the 
covariate and block effects is 
SST (adjusted) = SSE, — SSE; 


= 37.437 — 12.577 = 24.860 


with (t — 1) =3 degrees of freedom. The adjusted block sum of squares after fit- 
ting the covariate and treatment effects is 


S'S B(adjusted) = SSE** — SSE; 


= 20.363 — 12.577 = 7.786 


The analysis of covariance for the nutrient availability test with barley is shown in 
Table 17.8. 

The significance of the covariate requires a test of the null hypothesis Ap: 
( = 0 with the statistic 
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Table 17.8 Analysis of covariance for dry matter production of barley plants with 
percent blight damaged leaf area as a covariate 


Source of _Degreesof Sum of Mean 

Variation = Freedom Squares Square - as gee od 
Regression ] 26.590 26.590 23.263 0.001 
Block 4 7.786 1.947 1.703 0.219 
Treatment 2 24.860 8.287 7.250 0.006 
Error 11 12577 1.143 


MS RSS 32" 5 MOLE See Pas 2 G35 55 ie eis) Be RE ee epee |S Ge ee eee eee 
_ MS\(Covariate) 20.998 


Fo MSE 1.143 


= 23.263 


which is significant with Pr > F = .001 in Table 17.8. Thus, the relationship be- 
tween the percent blight damaged leaf area and dry matter production of the barley 
plants is significant. 

The estimate of the regression coefficient, which will be provided by most 


computer programs, is 3 = — 0.863. The negative coefficient indicates the dry 
matter production decreases with an increase in the incidence of the disease on the 
plant. 


The null hypothesis of no differences among the treatment means is tested with 
the statistic 
_ MST (adjusted) 8.287 


ae MSE “fe 


and the null hypothesis is rejected with Pr > F = .006 in Table 17.8. 


Adjusted Treatment Means and Their Standard Errors 


The adjusted treatment means are calculated the same as for the completely ran- 
domized design with Equation (17.4) and are shown in Table 17.9 along with their 
standard errors. The standard errors shown in Equations (17.7) and (17.9) can be 
used for the adjusted treatment means with the sums of squares for treat- 
ments, J7, = 26.55, and error, K,, = 35.70, from the analysis of variance for the 
covariate, blight infection, using the data in Table 17.7. The estimated average stan- 
dard error of the difference between two adjusted treatment means is 


— eT tes 
5 (Via—T ja) en 5 (3)35.7 


| = 0.76 (17.16) 


Interpretations with Multiple Contrasts 


The one-sided Dunnett 95% simultaneous confidence with the “Full” treatment as a 
control can be used to determine whether the soil was deficient in nutrients. The 
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Table 17.9 Adjusted means and their standard errors from the analysis of covari- 
ance for dry matter production of barley plants with percent blight damaged leaf 
area as a covariate 


Adjusted Standard 
Treatment Mean Error 
Check 26.08 0.58 
Full 29.49 0.52 
No 27.65 0.50 
Po 26.46 0.48 


soil is deficient if the plant grown in the control or Full treatment exceeds that for 
any treatments deficient in one or more of the nutrients; therefore, if the upper 
bound of the interval for treatment minus control is negative the treatment is defi- 
cient in nutrients. 

From Appendix Table VI the critical value of the Dunnett statistic for a one- 
sided interval is do5,3.11 = 2.31. The Dunnett criterion is D3, .05) = 2.31(0.76) = 
1.76. The 95% SCI upper bounds for differences between the treatments and the 
control, Full, treatment are shown in Table 17.10. 


Table 17.10 Upper bounds 95% simultaneous confidence intervals using the 
Dunnett method to compare the control treatment, Full, to other treatments 


Adjusted 95% SCI 
Treatment Mean (y;) Gi. = 7.) Upper Bounds 
Check 26.08 — 3.4] — 1.65 
Full ¥. = 29.49 ~ _ 
No 27.65 — 1.84 — 0.08 
Po 26.46 — 3.03 — 1.27 


The difference between the Check treatment with no added nutrients and the 
control treatment is 26.08 — 29.29 = — 3.41 with an upper bound, — 1.65, on the 
one-sided confidence interval. Thus, the soil is deficient in some unspecified nutri- 
ents. Specific tests for deficiencies in nitrogen and phosphorus require the compari- 
sons between the full treatment and the No and Potreatments. The difference 
between the nitrogen deficient treatment No and the control treatment is 
27.65 — 29.49 = — 1.84 with an upper bound of — 0.08. The difference between 
the phosphorus deficient treatment Pp and the control is 26.46 — 29.49 = — 3.03 
with an upper bound of — 1.27. Although the soil was deficient in both nitrogen 
and phosphorous, the upper bound for the phosphorous comparison was much far- 
ther removed from 0 than that for the nitrogen comparison, indicating a greater 
deficiency in phosphorous than in nitrogen. 
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17.4 Practical Consequences of Covariance Analysis 


Practical application of the analysis of covariance has been demonstrated only with 
completely randomized and randomized complete block designs. However, the use 
of covariates can be extended to any treatment and experiment design as well as to 
comparative observational studies of complex structure and studies requiring the 
use of multiple covariates for adjustment. The objective in this chapter was to intro- 
duce the basic ideas underlying the use of additional information on basic units of 
the study. The specific manual formulae for the analysis will depend on the specific 
treatment and experiment design employed for the study. However, in all cases the 
use of full and reduced models, as illustrated with the two designs in this chapter, 
will enable an assessment of the covariates influence on the reduction of experi- 
mental error and the significance of adjusted treatment means. 

Extensive discussions on the uses and misuses of covariates in research studies 
were provided in two special issues of Biometrics (1957), Volume 13, No. 3 and 
(1982), Volume 38, No. 3. Of particular interest are articles by Cochran (1957), 
Smith (1957), and Cox and McCullagh (1982). A number of issues arise relevant to 
the use of covariates. Among those concerns are the applicability in certain situa- 
tions and the relationship between blocking and covariates. 


Analysis of Covariance Combines the Features of Two Models 


The analysis of covariance combines the features of models for the analysis of vari- 
ance and regression to partition the total variation into components ascribable to (1) 
the treatment effects, (2) the effects attributable to any covariates, and (3) random 
experimental error as well as the variation associated with any design blocking fac- 
tors. The basic intention is to compare treatments at a common value for the 
covariate. 


When Covariates Are Superior to Blocks for Error Control 


On the surface, covariance analysis appears to offer an alternative to blocking for 
reducing experimental error. Blocking designs restrict the number of criteria that 
reasonably can be used for local control. On the other hand, covariance permits the 
use of any number of factors thought necessary. Covariance also makes better use 
of exact values for quantitative factors, whereas blocking groups the same fac- 
tors into classes of values. The advantage of covariates seems great when there 
are more than a few factors available as potential candidates for blocking variables. 


When Blocks Are Superior to Covariates for Error Control 


Covariance may be at a distinct disadvantage without blocking because random 
allocation of treatments to experimental units can result in an uneven distribution of 
treatments among the covariate classes. Any association between the covariate and 
the treatments confounds the effects of both on the response variable. Blocking on 
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the values of the covariates distributes the treatments evenly among the covariate 
classes and avoids confounding the effects of the treatments and covariates. 

Blocking is most effective with qualitative factors that produce recognizable 
variation among the experimental units. These factors include study management 
practices when tasks have to be performed by several technicians or on different 
days. Batches of raw materials provide natural effective qualitative blocking 
criteria. 

Whenever the systematic differences are highly recognizable blocking is a 
most effective means of reducing experimental error variation that maintains the 
orthogonality required to avoid confounding the effects of the covariates with the 
treatments. In summary, blocking is recommended as a first course to reduce ex- 
perimental error variance with adjustments on additional information if necessary 
with the analysis of covariance to further improve precision. 


Comparative Observational Studies at a Disadvantage 


The analysis of comparative observational studies can benefit equally from an anal- 
ysis that includes covariates to reduce error variance and adjust group means for 
differences in their covariate values. Observational studies suffer from the disad- 
vantage that units cannot be randomized to the defined treatment groups. The possi- 
bility exists for an influence on the response by additional unobserved covariates 
that are associated with the treatment groups, thus introducing an unknown bias 
into the group comparisons. Experimental studies have the advantage that the ef- 
fects of these variables are distributed among the units by randomization and their 
influence is much less likely to be confounded with the effects of the treatments. 


The Dangers of Extrapolation Beyond the Data 


Finally, caution must be used when treatment means are adjusted to a common val- 
ue for the covariate. Even though the regressions are parallel and there is no possi- 
bility that the treatments affect the covariate, the values of the covariate could be 
quite different for the treatment groups. If the covariate values are widely separated 
for the treatment groups, then adjustments would have to be extrapolated to a value 
of the covariate that is not common to either of the groups. An extreme example for 
illustration is a situation where income is used as a covariate for adjustment in com- 
paring a group of corporate executives to entry-level clerical workers. Clearly, 
there would be no overlap of the income levels for the two groups. The adjustment 
would apply to the extrapolated region of an average income not included in either 
group and a comparison would be made between two groups in a nonsense setting. 
Even if the extrapolated adjustments were valid the standard errors of extrapolated 
values would be quite large. 
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EXERCISES FOR CHAPTER 17 


1. An experiment was conducted on the shear strength of spot welds for three types of steel alloy. Six 
welds were made on each of the alloys and the force required to shear the weld was measured. The 
diameter of the weld was measured because it was believed that the strength of the weld was af- 
fected by its diameter. The data are shown in the table where y = weld strength and x = weld 


diameter. 
Alloy y x Alloy y x Alloy y x 
1 SPS 12S 2 S7.5Y 165 3 38@ 15.5 
l 40.5 14.0 2 69552175 3 44.5 16.0 
1 49.0 16.0 = 87.0 19.0 3 53.0 19.0 
1 $1.9 3-135,0 a 92-0)... 49.5 3 535.9 18.0 
l 615 18.0 2 107.0 24.0 3 58.5 19.0 
1 63.0. 195 4 lie: ga 6 60.0 20.5 
a. Use weld diameter as a covariate for weld strength, and write a linear model for the experi- 
ment, identify each of the terms in the model, and state the assumptions for the model. 
b. Conduct the analysis of covariance, and test the significance of the covariate and adjusted 
treatment means. 
c. Compute the adjusted treatment means, their standard errors, and an average standard error of 
the difference between two adjusted means. 
d. Plot the regression line for each alloy showing the observed means and adjusted means for 
each alloy. 
e. Compute the efficiency of the covariance adjustment. 
f. Test the hypothesis of homogeneous regressions for each of the alloys. 
g. Discuss the results of the experiment and the effectiveness of the covariance adjustment. 
h. The significance level of the test for homogeneous regressions was Pr > F’ = .068. When the 


source of variation for Group x Regression is not significant we end up with the sum of 
squares for error in the regular analysis of covariance assuming homogeneous regression 
coefficient. Effectively we are pooling the Group x Regression sum of squares partition with 
the sum of squares for error from the analysis of covariance for the model with separate re- 
gression coefficients for each treatment group. Hendrix et al. (1982) suggested in that case we 
should use a significance level of a = .20 or .25 since it was similar to the problem of testing 
incompletely specified models as discussed by Bozivich, Bancroft, and Hartley (1956). What 
do you think about this strategy? 

Suppose you subscribe to the philosophy of a significance level of a = .20 or .25 when testing 
the null hypothesis of equal regression coefficients for all treatment groups. The null hy- 
pothesis in part (f) is then rejected. Conduct the analysis of covariance with separate regression 
estimates for each treatment group, and compute the estimated regression coefficients and their 
standard errors for each alloy. What is your inference from the study at this point? 


2. A nutrition scientist conducted an experiment to evaluate the effects of four vitamin supplements 
on the weight gain of laboratory animals. The experiment was conducted in a completely 
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randomized design with five separately caged animals for each treatment. The caloric intake will 
differ among animals and influence weight gain so the investigator measured the caloric intake of 
each animal. The data on weight gain (y = grams) and calorie intake (x = calories/10) are shown 
in the table. 


DIES Yo ex igh ys x PITS! YO Mer Sy ihoy 
f= 38° 3D 2 65°"40 ean b skial 2 Beare pee 
1 67 44 fa a a se ae a 0 Oe 
1 78 44 Pe 2 ae a oy ag ee. Bee 
| ee ater - a ae a? 4 42 51 
La ae eae | ae: 3, 46 4 34 43 


a. Determine whether diet influenced calorie intake to the extent that the latter would be invali- 
dated as a covariate. 

b. Use calorie intake as a covariate for weight gain, and write a linear model for the experiment. 
Identify each of the terms in the model, and state the assumptions for the model. 

c. Conduct the analysis of covariance, and test the significance of the covariate and adjusted 
treatment means. 

d. Compute the adjusted treatment means, their standard errors, and an average standard error of 
the difference between two adjusted means. 

e. Plot the regression line for each diet showing the observed means and adjusted means for each 
diet. 

f. Compute the efficiency of the covariance adjustment. 

g. Test the hypothesis of homogeneous regressions for each of the diets. 

h. Discuss the results of the experiment and the effectiveness of the covariance adjustment. 

A plant scientist conducted an experiment to study the effects of drip irrigation water level on 

sweet corn growth, yield, and quality. Three levels of irrigation were used in the experiment (15.8, 

24.0, and 28.5 inches of water applied), and the experiment was arranged in a randomized com- 

plete block design to control for soil variability in the field. One of the response variables measured 

was the weight of culls per plot or the amount of sweet corn in the plot that was unsuitable for mar- 

ket. The number of plants per plot varied and this would affect the crop yield on the plot. Since the 

soil moisture was optimized to establish the crop stand the variation in plants per plot was not af- 

fected by the irrigation levels imposed after the crop was established. The observed yield of culls 

(y = metric tons/hectare) and x = plants in a 40-foot section of row are shown in the table. 


Irrigation Level 


| 2 3 
Block 5 POL OR 7 See 
1 Las 19 54 1.1 43 
3.1 58 13 57 1.8 60 
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a. Use plants-per-40-feet of row as a covariate for yield of culls, and write a linear model for the 
experiment, identify each of the terms in the model, and state the assumptions for the model. 

b. Conduct the analysis of covariance, and test the significance of the covariate and adjusted 
treatment means. 

c. Show the value of the regression coefficient. Compute the adjusted treatment means, their 
standard errors, and an average standard error of the difference between two adjusted means. 
Compute the efficiency of the covariance adjustment. 

e. Discuss the results of the experiment and the effectiveness of the covariance adjustment. 


4. An experiment was conducted in a randomized complete block design to study the effect of natural 
control, Bacillus, and a standard chemical insecticide for control of hornworm infestations on a 
crop plant. The treatments included four sources of Bacillus (Treatments 1-4), a standard chemical 
treatment (Treatment 5), and a control of no treatment (Treatment 6). The treatments were applied 
to plants grown in field plots in the field. The number of hornworms (count) on each plant were 
counted prior to treatment. The number of live hornworms (/ive) were counted 20 hours after appli- 
cation of the treatment. The data for each plot are shown in the table. 


Block 
1 Z 3 4 
Treatment Count Live Count Live Count Live Count Live 
Bacillus 1 15 lF pe, 26 18 21 25 26 
Bacillus 2 19 18 Zz pe: 20 19 19 20 
Bacillus 3 19 19 19 Zt 21 Zo Zo oe 
Bacillus 4 ZZ 14 31 26 17 17 19 19 
Chemical a7 5 a2 6 26 13 18 10 
Control p#) Za 14 19 ee) 26 23 a7 


a. Use the count of hornworms before treatment as a covariate for the number of live hornworms 
20 hours after treatment, and write a linear model for the experiment, identify each of the 
terms in the model, and state the assumptions for the model. 

b. Conduct the analysis of covariance, and test the significance of the covariate and adjusted 
treatment means. 

c. Show the value of the regression coefficient. Compute the adjusted treatment means, their 

standard errors, and an average standard error of the difference between two adjusted means. 

Compute the efficiency of the covariance adjustment. 

Discuss the results of the experiment and the effectiveness of the covariance adjustment. 

f. The response variable is a count measure and likely does not have a normal distribution. Did 
you check the assumptions of homogeneous variance and normal distributions of experimental 
errors for the model? If not, do so now, and if necessary take corrective actions (according to 
discussions in Chapter 4) and repeat parts (b) through (e). 


o 2 


5. The analysis of covariance can be used to estimate a missing value in a blocked design and provide 
an unbiased estimate of the treatment sum of squares to test the hypothesis of no differences among 
the treatment means with a missing value (Coons, 1957). 
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A covariate x is introduced for the missing value. The covariate is assigned the value « = — 1 
for the missing y value and x = 0 for all other values of y that are not missing. A value of y = 0 is 
assigned to the missing value. Compute a regular analysis of covariance with the values assigned to 
y and the covariate x. The estimate of the missing value is the estimate of the regression coefficient 


B= Ezy /Ezz, and the adjusted treatment sum of squares in the analysis of covariance is the correct 
sum of squares to test the null hypothesis of no differences among the treatment means. 

As a demonstration of the technique use the data for the randomized complete block design in 
Exercise 17.3 and ignore the covariate, s = number of plants, for this exercise. Assume the obser- 
vation y = 2.9 on irrigation level 2 in block 3 is missing and assign it the value y = 0. Construct 
the new covariate with x = — 1 for the missing value and x = 0 for all other values of y, and con- 
duct the analysis of covariance. Estimate the missing value, and use the adjusted mean square for 
treatments to test the hypothesis of no difference among the treatment means. 


Describe a study in your own field in which a covariate is used in addition to or in place of a block- 
ing criterion to reduce the error variance and adjust treatment means. Provide justification for the 
use of the covariate values directly in the statistical model in place of blocking on the covariate, 
and justify its use on the basis that it is not affected by the treatments. Use an example from a jour- 
nal article or your own research experiences if possible. 
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TABLES 


Standard normal distribution; entries are P(Z > z) =a 
Student's ¢ distribution; ta, for P(t > tay) =a 
Chi-square distribution; x3, for P(x? > x4,,) =a 
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F Max test; F, Maz for P(F Maz > FyMaz) = a fort 
treatment groups and v degrees of freedom 


F test power curves for fixed effects model analysis of variance 

F test power curves for random effects model analysis of variance 
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Table I Standard normal distribution; entries are P(Z > z) =a 


z .00 O1 .02 .03 04 .05 .06 .O7 .08 .09 


0.00 5000 4960 .4920 4880 .4840 4801 .4761 4721 4681 4641 
0.10 4602 4562 4522 4483 4443 4404 4364 4325 4286 .4247 
0.20 4207 4168 .4129 4090 4052 4013 .3974 3936 .3897 3859 
0.30 5821. 3782 13 3707 - 3069 - 3632. 3594 3557... 3520 243 
0.40 DRUG — SACS 533 fe 38450. 3500 32 = 38 3192. 3156 SEI 
O30 = 3063 . S050 30is 2981. 2946 2912 er) 2843 260 2776 


0.60 2743 2709 278 as 6) 2578 eae 54 242451 
0.70" << gAZO 2388 ase Geel 2296. 2266 ee ee 277 =. 2748 
O80 2119 -2090 | 2061 2033 2005 1977 1989" AQ22 ~.1894 1867 
0.90: 164] es ee 62 ©1736 711 G83 66D. «G35 =. 1611 
POU ASS? ASG2 2 iaee at 292. 409 1496 ae ee. 1379 


LIK; A357 2.3935 Sa ee ee ADS: SS ae ee 
20: -. DIST So Reb PL eas 10752 11056: 1038 = see8 «41003. 0985 
130 .0968 .0951 .0934 0918 0901 0885 0869 0853 .0838 .0823 
140 .0808 .0793 .0778 .0764 .0749 .0735 .0721 .0708 .0694 .0681 
150 .0668 .0655 .0643 .0630 .0618 .0606 .0594 0582 .0571 .0559 


160 0548 .0537  .0526) .0516 = .0505 0495 .0485 .0475 .0465 .0455 
1.70 .0446 .0436 .0427 0418 .0409 0401 .0392 .0384 .0375 .0367 
1.80 0359 0331 ..0344 0336 0329 0322 ...0314.., O307. 0301 62904 
1.90 0287 “0281 -~ 0274 © 02686" “0262 0256 .0250 —.0244 = —.0239 ~—_..0233 
2.00 0228 0222 0217: Min S2e/7 «6202. 097. - O82 ee 83 


2iAQ~ OL O14 0170 0166  Ole@2 8158 . 0154 — 0150. 0iae 0143 
2:20; O19. VI0 . 0132 . 01 igo 0i22:. OLS Ae. is O96 
2.30 0107 .0104 .0102 .0099 .0096 0094 .0091 .0089 .0087 .0084 
2.40 .0082 .0080 .0078 .0075 .0073 .0071 .0069 .0068 .0066 .0064 
2.50 .0062 .0060 .0059 .0057 .0055 .0054 .0052 .0051 .0049 .0048 


2.60 .0047 .0045 .0044 .0043 .0041 .0040 .0039 .0038 .0037 .0036 
2.70 . USS S034 0033. 0032 003) .0030 0029° 0028 027 0026 
2.80= — 70026: . "0023: = 20024 “ UU23S- 025 0027-0021 0021 = 0020. 0019 
2.90 .0019.:. 0018 — 0018. .0017:+ 0016: 0016 ..0015 .0015 .0014 0014 
3:00: 0013: 01S — 0013. - 2Uirz~ re 001) =~ OT 001: 0010 = 0010 


Adapted with permission from CRC Standard Probability and Statistics Tables and Formulae (1991), 
edited by W. H. Beyer, CRC Press, Boca Raton, Florida. 


Table II Student's t distribution; t,,, for P(t > ta») =a 


ae 0.40 
] Pde) 
2 289 
a oe 8 
4 271 
5 .267 
6 265 
7 263 
8 262 
9 261 

10 .260 

11 .260 

12 259 

13 259 

14 258 

15 258 

16 258 

7 aot 

18 257 

19 el 

20 257 

21 oS | 

22 256 

23 256 

24 256 

25 256 

26 256 

Zt 256 

28 256 

29 256 

30 256 

40 255 

60 254 

120 254 
CO 53 


Computed with MINITAB by R. O. Kuehl. 


0.25 


1.000 
.816 
765 
741 
dat 


718 
711 
.706 
.703 
.700 


.697 
695 
.694 
.692 
.691 


.690 
.689 
.688 
.688 
.687 


.686 
.686 
.685 
.685 
.684 


.684 
.684 
.683 
.683 
.683 


.681 
.679 
.677 
.674 


0.10 


3.078 
1.886 
1.638 
$535 
1.476 


1.440 
1.415 
1.397 
1.383 
1.372 


1.363 
15356 
1.350 
1.345 
1.341 


L337 
1.333 
1.330 
1.328 
1325 


1323 
1.321 
E319 
1.318 
1.316 


+315 
1.314 
L3t3 
re! 
1.310 


1.303 
1296 
1.289 
1.282 


0.05 


6.314 
2.920 
7A be fo 
2tae 
2.015 


1.943 
1.895 
1.860 
1.833 
1.812 


1.796 
1 782 
Oe eS 
1.761 
1 253 


1.746 


1.740 
1.734 
129 
1.725 


1.721 
| BS 
1.714 
1.711 
1.708 


1.706 
1.703 
1.701 
1.699 
1.697 


1.684 
1.671 
1.658 
1.645 


0.025 


12.706 
4.303 
3.182 
2.776 
peg 8 


2.447 
2.309 
2.306 
2.002 
2.228 


2.201 
2,179 
2.160 
2.145 
2.131 


2.120 
2.110 
2.101 
2.093 
2.086 


2.080 
2.074 
2.069 
2.064 
2.060 


2.056 
2,052 
2.048 
2.045 
2.042 


2.021 
2.000 
1.980 
1.960 


0.01 


31.821 
6.965 
4.541 
3.747 
2,305 


3.143 
2.998 
2.896 
2.821 
2.764 


2.718 
2.681 
2.650 
2.624 
2.602 


2.583 
2.567 
Pasties 
050 
2.528 


2.518 
2.508 
2.500 
2.492 
2.485 


2.479 
2.473 
2.467 
2.462 
2.457 


2.423 
200 
2.358 
2.326 
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63.657 
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5.841 
4.604 
4.032 


3.707 
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Spe eB. 
3.250 
3.169 


3.106 
3.055 
3.012 
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2.921 
2.898 
2.878 
2.861 
2.845 


2.831 
2.819 
2.807 
Lg ee 8! 
2.787 


2.779 
2.741 
2.763 
2.196 
2.750 


2.704 
2.660 
2.617 
2.576 
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Table VI Multiple comparisons with the best and Dunnett tests; 
dake for P(d = dake v) =a 


a = 0.05 (one-sided) 
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From C. W. Dunnett (1955) “A Multiple Comparison Procedure for Comparing Several Treatments 
with a Control.” Journal of the American Statistical Association 50, 1112-1118. Reprinted with 
permission from Journal of the American Statistical Association. Copyright 1955 by the American 
Statistical Association. All rights reserved. C. W. Dunnett (1964) “New Tables for Multiple 
Comparisons with a Control.” Biometrics 20, 482-491; and additional tables produced by C. W. 
Dunnett in 1980. 
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Table VI Multiple comparisons with the best and Dunnett tests; 
des ky for P(d Ps eh) oy #5 


a = 0.01 (one-sided) 
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Table VI Multiple comparisons with the best and Dunnett tests; 
do,kv for P(|d| > da,kv) = a 
a = 0.01 (two-sided) 
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APPENDIX 603 


Table VIII = F'Maz test; Fy Maz for P(F Maz > F,Mazx) =a fort 
treatment groups and v degrees of freedom 


a = 0.05 
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Adapted from Biometrika Tables for Statisticians, Vol. 1, 1966, edited by E. S. Pearson and H. O. Hartley, 
with permission of the Biometrica Trustees. 


Table IX F test power curves for fixed effects model analysis of variance 
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Table IX F test power curves for fixed effects model analysis of variance 
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Table IX F test power curves for fixed effects model analysis of variance 
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Table IX F’ test power curves for fixed effects model analysis of variance 
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Table [IX F test power curves for fixed effects model analysis of variance 
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624 APPENDIX 


Table XII 10,000 random digits 

40092 81747 62038 32614 91062 95535 85635 46183 22020 86502 
04040 77141 68126 39738 00241 90805 90476 31866 43819 72869 
27744 59344 15764 27705 86115 10818 33029 47185 78385 47791 
03012 35878 31415 02956 73710 73199 71014 68069 42556 25033 
03756 14835 94850 14777 02637 98934 18678 52385 53701 97816 
32815 29692 41490 78596 23967 53733 32292 33595 89471 51662 
13702 22046 10820 74519 09094 42165 96372 43048 82936 34352 
90988 43962 72505 00500 42042 73894 50219 71653 90329 61575 
62355 05504 69481 95689 06051 27611 72996 91049 80893 71328 
49076 28508 TIS92 13453 51243 18476 54021 05696 11779 15863 
36273 09222 95599 32931 21871 98668 99815 94235 42693 51724 
68993 03491 80984 48526 90843 29395 51200 97425 39987 36822 
39920 31267 60848 © 05034 03234 64340 09434 53143 05462 34473 
54411 06333 36913 08937 96649 36916 29084 26470 41677 97141 
36589 59006 78749 47682 81371 87395 46397 88578 53407 45513. 
50992 10021 61655 46972 77136 72126 (EES 54573 21754 34005 
89336 50439 71992 55854 86657 02737 25120 67498 40731 63664 
54553 95650 25878 12743 41984 83063 34507 74733 54853 04773 
87225 55691 25090 31227 86467 65431 19301 21752 57547 99668 
89701 46487 48959 69554 59786 04082 30302 18492 82721 90393 
04307 47780 07824 26546 52978 22002 95034 57905 33641 63382 
73569 68313 08308 32795 27608 20956 92853 60375 10364 56966 
14979 67281 49874 26421 25443 48250 15627 85412 67329 71950 
63914 98384 78123 83289 46311 38100 89622 74531 91274 67461 
86390 62009 13480 13275 85547 67966 55299 24147 21527 38419 
32007 57426 35585 85429 18445 49946 93822 81152 08889 94365 
01836 33548 68505 64965 21008 99708 75050 86951 00673 13933 
32452 64986 81436 50212 35425 13971 11767 48191 08463 83295 
91424 82977 CVARE) 80372 33184 56791 72463 94831 11268 26694 
38244 80884 35719 07613 73294 39929 89473 92680 85910 06938 
26010 21397 37765 63680 04611 37636 92158 23594 14873 09599 
67054 42605 18159 00922 60210 80042 76776 48738 64415 99442 
83513 91260 18937 89770 87417 66732 54208 99378 77020 84131 
65512 49102 25594 51608 64913 61490 73883 29741 66253 68670 
77810 94342 98901 31379 05225 GI931 51528 10666 37760 89043 
07968 43404 73783 69894 01357 34726 89661 89342 30408 82356 
85370 05580 Pek Pe ae 86903 31950 | 82648 57589 58573 58332 55619 
60923 13031 43571 62495 96389 46218 61748 42218 95879 97246 
85495 42421 04882 67652 717291 12909 73833 45871 71388 82924 
06328 30206 35409 78693 67795 64053 64309 51064 02516 27591 
76447 35060 98444 14312 04058 85851 61351 22310 36618 90115 
78866 29368 84660 13189 86648 88606 30739 67032 70300 45249 
02448 59524 35486 74847 64272 26363 89711 29296 66690 35704 
90016 85192 20040 97348 38337 39697 74895 21585 09245 63855 
30806 23228 45362 65937 86967 98883 50230 70750 34638 60422 
61224 84197 05618 34422 46881 35190 65108 83574 91763 74682 
69058 80489 83177 79446 28867 06358 21001 13830 51949 33314 
29731 88669 90396 86789 06896 21048 21903 65161 69464 18954 
55831 36845 20465 09528 42037 BM es, 15875 83455 15612 09552 
54654 11577 44676 07215 37288 90974 48352 07127 62359 36357 


Computed by R. D. Axelson and R. O. Kuehl. 


APPENDIX 625 


Table XII 10,000 random digits 

26862 23753 16576 48185 26188 37728 25811 97801 49522 37980 
88683 98306 07900 23707 99948 33025 74273 47346 28068 67382 
11649 19469 97566 73268 84821 17286 78963 69433 27656 20853 
11539 08341 75699 97873 28867 21957 62406 20655 43287 20888 
56151 48566 02341 00975 07214 50747 88664 98610 Zisi3 64162 
69530 17535 27838 40610 50380 78140 07497 36527 50167 46282 
15818 76764 92511 03502 03169 06702 59346 35361 77128 18245 
50336 94817 84848 65960 76977 95100 63306 19380 46687 S115 
28406 21331 85488 88274 48787 44075 86769 09610 02927 33135 
72796 02583 10970 77788 83695 69263 20800 32501 91294 98923 
19569 04841 41380 89265 01799 11059 93494 66138 13304 12325 
75557 66470 34334 82621 74506 43534 26514 11624 42412 76386 
11018 38364 01933 32767 56338 85843 84565 62142 29687 17325 
91226 45783 50106 60482 03949 91614 06628 65204 65570 32278 
14651 00799 86285 40491 42792 51441 91259 33280 21469 14226 
45076 54205 08493 36125 46555 08566 12084 62827 00321 56568 
26297 93649 18706 64045 25840 79460 46865 92165 93820 34790 
92147 37854 71738 19132 03564 68174 93762 37616 05231 55992 
78686 88798 72430 69712 59483 93455 12339 77474 55033 80771 
39337 46482 68067 62770 22840 32935 56812 14135 19428 19452 
22252 15585 55870 $2512 14044 26189 89418 52296 38387 75269 
57240 75441 73297 63779 59276 17017 $5257 30527 iziz2 35094 
04322 54208 11657 11593 70223 60145 99744 91446 70523 31993 
27666 19664 78042 51445 47023 30784 69771 82770 55057 52078 
53574 41181 17109 61114 83876 95597 41259 37891 84897 89452 
57403 53175 09644 44756 43387 57407 48145 42504 63603 37123 
47663 74298 42107 67427 31764 81212 60736 57428 14755 42171 
23443 29258 09203 79058 55323 12976 32399 17237 85784 02217 
51643 09345 84572 28560 87805 02876 45418 73275 39984 96349 
80051 75581 Zaizt 58813 75587 23116 83055 86758 00086 80536 
43926 60090 60696 72918 51179 49157 07915 65483 78143 10451 
10254 05670 92734 24229 38248 84834 02502 09713 34985 31628 
59972 80565 94063 83430 54499 83544 62526 58198 51422 43211 
60281 89538 02140 93302 13432 15221 69413 13619 12440 49242 
16811 18472 29709 39684 72700 S221 03202 66143 91079 90161 
01902 59532 16742 87759 74717 38255 98729 42024 47121 20304 
41733 54639 64241 70303 57220 78894 00192 03386 38487 27908 
43355 69142 53361 09024 11162 46052 93213 17886 91801 98031 
70729 22517 60942 41250 37148 24255 03358 38681 54169 01229 
99082 83541 65511 79195 35411 30115 34826 92891 16193 49832 
06263 94134 73079 02018 93129 25999 62186 62716 39844 71598 
25600 31706 45184 04039 12757 17543 57061 35372 14213 75793 
38681 10679 56604 69704 00663 87405 30603 90223 22174 06601 
80481 16829 91125 21262 03530 61672 46950 15458 10962 44808 
47010 50624 57003 15278 31486 44583 73363 00197 45102 14852 
77358 22444 92701 91049 78909 10623 57688 61120 83266 83874 
38510 77101 96660 96889 78941 07529 22497 96254 04580 48105 
81400 38106 00865 27113 91974 34324 39319 71900 27482 66416 
99793 34087 52787 02700 29651 15165 64316 69420 08167 94154 
55128 29344 80171 25092 80960 14721 59328 06345 91581 82399 


626 APPENDIX 


Table XII 


27716 
88339 
37968 
oa 
70360 


00645 
89194 
55980 
13171 
86112 


43520 
38819 
33517 
83251 
41449 


84803 
84633 
01638 
28674 
§3293 


68589 
45432 
11054 
42514 
07047 


17561 
05911 
17612 
06111 
74570 


44776 
83366 
01073 
42767 
13285 


76273 
24056 
49676 
64991 
28089 


96534 
50836 
28746 
80584 
32514 


15164 
39324 
83484 
42484 
65911 


10,000 random digits 
39430 74892 55062 
55349 44443 05790 
25059 37540 04687 
55975 42519 36245 
58659 09025 65477 
21621 36761 53781 
42646 63316 59445 
75632 55491 14356 
81146 28515 59825 
27709 22965 16439 

— 50388 33109 80118 
71158 07453 94947 
96172 08551 91814 
07374 39275 31071 
51749 75858 33478 
30272 85826 18935 
75447 06239 52081 
69018 93805 73036 
59616 07748 09703 
38075 26568 69884 
63091 88890 38333 
44958 68132 99054 
89885 77436 86004 
56335 12990 36130 
90954 35462 86360 
05233 52549 90437 
53885 56556 40850 
20461 62032 47587 
59316 99244 53028 
09252 44243 51742 
03402 35319 10026 
42305 34398 66549 
59291 56232 91521 
58685 31976 44953 
58416 71593 12007 
17196 84114 42240 
80981 20168 29047 
14882 65467 73855 
57009 77897 83153 
98413 31829 62562 
97921 13604 70506 
96073 16019 A8998 
47219 30895 23263 
53948 44644 76912 
27970 73626 19182 
71201 39626 58961 
98893 22000 96405 
60576 62727 96233 
28641 47335 80096 

27235 53577 04477 


tO! 
77996 
53545 
01377 
56751 


73718 
18655 
56454 
88041 
21368 


52631 
52652 
11724 
41215 
82088 


24913 
94605 
07877 
49503 
37289 


#1939 
63817 
68590 
46050 
24284 


92488 
14137 
17035 
21979 
74563 


27044 
99205 
98063 
26620 
65202 


14314 
20856 
03075 
95013 
48896 


29624 
21487 
60178 
98636 
31168 


33178 
15322 
01802 
84105 
95852 


37327 
60410 
26673 
03940 
44808 


58716 
46159 
00347 
74881 
60597 


28818 
20698 
65092 
42198 
93677 


79617 
29414 
41875 
74360 
14617 


37194 
30182 
73162 
93096 
09102 


57833 
97683 
83371 
10783 
07363 


19091 
74699 
96809 
49569 
61687 


49976 
65202 
61191 
86277 
20935 


44005 
50844 
50722 
95840 
63792 


38641 
77189 
98190 
19111 
70219 


49419 
41369 
40810 
29064 
61307 


81236 
16175 
62540 
65046 
39390 


77139 
05598 
65214 
15736 
$2579 


72859 
90088 
73920 
14030 
18927 


21497 
73247 
92891 
63546 
25426 


83542 
84795 
94995 
44488 
56710 


51864 
57967 
25210 
85394 
48650 


68714 
16533 
64955 
70494 
94369 


92916 
90073 
15734 
78725 
07136 


85647 
92508 
76160 
74979 
82456 


30022 
22078 
48176 
34000 
88623 


17964 
17398 
86968 
53098 
78060 


37368 
87513 
17879 
20401 
56769 


79433 
86603 
87707 
35940 
93178 


E3995 
57265 
11720 
78406 
15567 


05540 
51261 
50315 
22348 
157/35 


70008 
96962 
96350 
13279 
27987 


33538 
28520 
48555 
18046 
48925 


41939 
76007 
60946 


77378 | 


97354 


45973 
53154 
59963 
42318 
19107 


98867 
31121 
06334 
65999 
89037 


70404 
72326 
68038 
59673 
05766 


90715 
45302 
79958 
62220 
13978 


94633 
01489 
71362 
10022 
30591 


35084 
53261 
71241 
62320 
25015 


91798 
66590 
41561 
98002 
73850 


56725 
17378 
46086 
74548 
23498 


06533 
11420 
91374 
56932 
OTTT3 


35505 
87108 
89808 
30973 
59750 


19310 
47615 
84478 
84907 
99280 


91330 
13179 
98994 
20130 
68398 


57782 
11634 
58485 
49321 
87653 


76626 
94056 
06629 
15274 
P7131 


40520 
17407 
30956 
42009 
23286 


15634 
25571 
89156 
51346 
88667 


98934 
77263 
21500 
18472 
23924 


34529 


— 43224 


22857 
74456 
72343 


87191 
96331 
80971 
20434 
37099 


90644 
54137 
82700 
15414 
85256 


84457 
14436 
45070 
52870 
08322 


Table XII 


94221 
84431 
05455 
70079 
11446 


45708 
79659 
29884 
43279 
76413 


22492 
58291 
27126 
00842 
73267 


00964 
19944 
70288 
79468 
46373 


61516 
41099 
11024 
70868 
87057 


22388 
46942 
29806 
14617 
25809 


34780 
94677 
12250 
15253 
74787 


537379 
25663 
88330 
27651 
44555 


43299 
08740 
01029 
77885 
09852 


00298 
22831 
43584 
69116 
49035 


10,000 random digits 
07150 80463 03658 
13952 08149 80802 
88879 13449 32774 
21122 27004 58731 
23521 48438 72828 
85141 04684 34025 

33506 87595 03537 
97311 35812 45807 
67351 27095 17012 
06486 82936 33337 
13665 36925 48848 
55383 67639 66799 
62422 62990 56929 
24512 32501 WsTi2 
31540 46725 32423 
87753 51908 44409 
78118 79260 23252 
68077 60880 23921 
67331 29400 11097 
06322 61880 71922 
21667 78846 71110 
45697 50877 86382 
75195 61800 59089 
80366 38791 71706 
86347 | 29685 79945 
64789 68609 63841 
66348 10183 50169 
89043 20745 66576 
79295 99744 62723 
85025 49691 68786 
60969 42518 07858 
73718 64271 59301 
10164 28403 22306 
78715 01384 02794 
90110 69156 62945 
47097 40942 61028 
48264 05635 98825 
25095 40796 63642 
22887 84669 92353 
85772 01754 36006 
87460 52469 24806 
06533 80838 38622 
77011 70206 54340 
85449 53717 52475 
08312 18396 34775 
02548 91069 14424 
43108 29166 08517 
61723 29354 96769 
44955 37-85 63158 
19029 73264 77376 


07013 
09136 
53964 
21160 
21550 


71208 
59265 
W277 
94794 
78926 


50459 
62393 
84904 
28232 
36226 


93930 
34690 
11896 
23247 
24235 


41710 
62166 
42760 
26600 
79152 


44728 
19373 
16824 
68156 
43859 


17204 
31606 
32881 
91983 
69569 


56704 
24427 
62300 
24804 
91187 


79248 
15149 
62941 
89906 
60762 


10383 
07100 
77087 
44994 
96988 


Be 
71238 
18829 
80238 
35582 


19992 
92924 
94148 
30437 
25558 


49600 
04016 
19431 
10170 
89466 


08466 
76692 
20005 
71989 
47410 


64814 
07811 
38788 
01965 
46353 


45553 
54011 
75842 
47838 
50193 


98628 
66479 
58027 
00765 
13906 


29036 
77318 
23016 
40367 
72110 


53238 
08813 
29887 


40846: 


32742 


88365 
78997 
33578 
41991 
72124 


56464 
55048 
21872 
68780 
32768 


51615 
83027 
67863 
68917 
33856 


52743 
17872 
98746 
72564 
48202 


98989 
19405 
08466 
00401 
66986 


57890 
OF21i 
66395 
70249 
73810 


38269 
74267 
76690 
17085 
72474 


46176 
97437 
08882 
88148 
55002 


32246 
19674 
32139 
78290 
23796 


39640 
83839 
87785 
13143 
97343 


2e79) 
90983 
93149 
43607 
70367 


04052 
09919 
74294 
82384 
69773 


71231 
41286 
24496 
13450 
14237 


20066 
55180 
55471 
21429 
25074 


17080 
siz 
86560 
05865 
26995 


26749 
05631 
47948 
43459 
50543 


97970 
03039 
65355 
81836 
26608 


04768 
87625 
01821 
56337 
99567 


57688 
44417 
43912 
45034 
63114 


49745 
96731 
78345 
50758 
75834 


58311 
13085 
23121 
82729 


68128. 


APPENDIX 627 


26053 
40389 
85282 
20602 
38451 


88276 
01315 
87510 
03254 
17551 


76869 
62652 
68157 
20642 
75094 


34531 
51394 
31270 
67119 
12239 


11543 
26154 
35330 
66862 
60423 


56182 
40453 
04887 
43124 
21241 


96909 
69066 
67718 
31053 
49338 


76317 
51022 
14635 
17498 
12759 


48627 
26326 
90186 
29288 
22023 


75041 
19481 
12969 
72299 
89698 


81893 
61875 
92231 
92898 
79113 


58194 
14180 
42172 
05915 
59814 


79832 
10594 
65921 
27757 
60558 


77828 
48362 
69837 
80106 
86031 


88532 
32851 
74353 
15575 
98381 


01730 
44193 
68928 
47199 
66151 


23402 
53509 
74001 
17782 
95426 


67112 
69841 
06138 
93457 
12726 


56065 
13098 
65065 
ef93S9 
87492 


45545 
86602 
04072 
06758 
06584 


628 APPENDIX 


Table XIII 400 random permutations of first 10 integers 


os 410-3 


i 


10 


10 


10 


10 


10 


co ~~ OO 


10 


10 


10 


10 


10 


10 
7 
4 


3 
0 
2 


3 
10 
8 


4 


10 


co 


10 


10 


10 


10 


co 


10 


10 
5 


7 


10 


3 


10 


4 


10 
1 


10 
1 


10 


10 


10 


10 


10 
3 


4 
3 


4 
6 
2 
10 


10 
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Table XIII 400 random permutations of first 10 integers 
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Answers to Selected Exercises 


Chapter 2 
| oe Yij = Pit C3; ae Be SEE ee Ae eee, 
4; = signal type mean, e;; = experimental error 

b. e;; random, independent with mean 0 and variance o? the same for all signals 

c Source df SS MS 
Signal 2 ~. 1,202.63 601.31 
Error 12 137.83 11.49 
Total 14 =: 1,340.46 


d. 35.5,20.9,14.0 standard error = 1.52 
e. (32.2, 38.8), (17.6, 24.2), (10.7, 17.3) 

f “Fo = 324, FOS, 2, 12) = 9-89, Resect 
g. Spy = 177.4, Spa = 104.5, 53 = 70.0 


eee Yij = Wit ei; ots WO Pa ee gs S| 
ji, = treatment mean, e;; = experimental error 
e;; random, independent with mean 0 and variance o? for all treatments 


Cc. Source df SS MS 
Treatment 4 48,569 12,142 
Error 20 3,110 156 
Total 24 51,679 


86.37, 112.85, 206.16, 110.69, 87.97 standard error = 5.59 

(74.71, 98.03), (101.19, 124.51), (194.50, 217.82), (99.03, 122.35), (76.31, 99.63) 
Fo = 78.08, F'(.05, 4, 20) = 2.87, Reject 

Spy = 431.87, Sy = 564.26, 5u3 = 1030.78, 54 = 553.47, Sus = 439.86 


mame e 


anes Yij = Pi t+ Ci; eo ee Bee Ck eee sk ees ee 
4, = treatment mean, e;; = experimental error 
e;; random, independent with mean 0 and variance a? for all treatments 


c Source df SS MS 
Treatment 4 0.175969 0.043992 
Error 7 0.000094 0.000013 
Total 11 0.176062 


633 


634 CHAPTER 3 


Chapter 3 


Tmo 


mean(std err), 0.049(0.0018), 0.086(0.0026), 0.116(0.0026), 0.187(0.0026), 


0.397(0.0026) 


(0.045, 0.053), (0.080, 0.092), (0.110, 0.122), (0.181, 0.193), (0.391, 0.403) 


Fy = 3284.75, F(.05, 4, 7) = 4.12, Reject 
4p = 0.197, 2p. = 0.171, 23 = 0.231, 24 = 0.374, 2us = 0.794 


Use D = 30,t = 5,07 = 156 in Equation (2.26). If r= 7, ® =2 and power is approximately 
0.93 in Appendix IX for p; = 4 and 1. = 30 


cae oR 


estimate (std err), (i) 18.03(1.86), (ii) 6.90(2.14) 
(i), 1083.603, (ii) 119.025, SST = 1202.628 

t(.025, 12) = 2.179, (i) to = 9.71, Reject, (ii) to = 3.22, Reject 
MSE = 11.49, F(.05, 1, 12) = 4.75, (i) Fo = 94.34, Reject, (11) Fo = 10.36, Reject 


f= PE 


Source df SS 


Nitrogen 4 4,994.8 
Linear a 
Quadratic ie, ad 
Cubic l 67.6 
Quartic 1 185.7 
Error 15. ; 33344) 


MS 


1,248.7 
2,958.4 
1,783.1 
67.6 
185.7 
22:5 


Fo 
5.61 
13.29 
8.01 
0.30 
0.83 


Fy for linear and quadratic exceeds F(.05, 1, 1.5) = 4.54; thus quadratic is the best-fitting 


polynomial. 


Estimated coefficient (std err): y. = 142.60(3.34), a; = 8.60(2.36), ag = — 5.64(1.99), 


a3 = 1.30(2.36), ag = — 0.81(0.89) 
Quadratic regression equation is ¥ = 114.11 + 0.6234.N — 0.0023N? 


Nitrogen Mean Predicted Std Err 
0 112.0 114.1 

50 145.5 139.6 

100 149.0 $53.9 
150 Eis 156.8 
200 149.0 148.5 


Patient 4; max Y; 
j#i 
1 167.0 244.6 
2 185.5 244.6 
3 104.0 244.6 
4 214.9 244.6 
5 149.0 244.6 
6 169.4 244.6 
7 160.5 244.6 
8 244.6 2149 


7.0 
4.5 
32 
4.5 
7.0 


d(.05, 7,8) =2.81 M =2.81,/2(6.02)/2 = 6.89 


95% CI 

(99.2, 129.0) 
(130.0,149.2) 
(142.8,165.0) 
(147.2,166.4) 
(133.6,163.4) 


D;-M D+M 


— 84.5 — 70.7 
— 66.0 = 322 
—147.5 — 133.7 
— 36.6 — 22.8 
— 102.5 — 88.7 
— 82.1 — 68.3 
— 91.0 — 77.2 
22.8 36.6 


95% SCI 
(L,U) 


(— 84.5, 0) 
(— 66.0, 0) 
(— 147.5, 0) 
(— 36.6, 0) 
(— 102.5, 0) 
(— 82.1, 0) 
(— 91.0, 0) 
(0, 36.6) 


Patient 8, with D; + M > 0, is selected as the subject with the highest cholesterol. 


t. 


11. 
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d(.05,3, 16) = 2.59, D(.05,3) = 2.59,/2(6.875)5 = 4.3 


95% SCI 
Treatment Mean 7; —7, (L,U) 
Control 42.2 — _ 
3% Glucose 29.0 —13.2 (-—17.5, — 8.9) 
3% Fructose 27.6 -—146 (-— 18.9, — 10.3) 
3% Sucrose 34.0 —82 (—12.5, —3.9) 


All treatments differ significantly from the control since none of the intervals include 0. The 3% 
fructose is most removed from the control, and the 3% sucrose is least removed from the control 
based on the upper limits. 


F(.05, 212). = 3.89 
Contrast Estimate Std Err S(.05) 


Pretimed vs. others 18.03 1.86 5.19 
Semi vs. Fully 6.90 2.14 3.97 


Both contrasts are significant since the estimates exceed S(.05). 
ee be ee 20 df 

LSD(.05) = 16.45, HSD(5, .05) = 23.60, SNK(2, .05) = 16.45, 
SNK(3, .05) = 19.95, SNK(4, .05) = 22.08, SNK(5, .05) = 23.60 


Significant Pairs by Each Criterion* 


Treatment Mean LSD(.05) HSD(5,.05) SNK(k, .05) 
1. Premolt 86.37 2,3,4 2,3,4 2,3,4 
2. Fasting 112.85 1,3,5 13.5 285 
3.60 gmbran 206.16 1,2,4,5 1,2,4,5 1,2,4,5 
4.80 gmbran 110.69 i ‘3 13S 
5. Laying mash = 87.97 2,3,4 23 2,3,4 


“Each row of numbers identifies those treatments declared significantly 
different from the treatment shown in the first column of that row. 


e. The HSD provides greater strength of inference with confident inequalities than does LSD or 
SNK; they provide tests of homogeneity only with the weak sense experimentwise error rates. 


f. Tukey 95% SCI 
Contrast Estimate (L,U) 
Bran60 — Bran80 95:5 (71.9, 119.0) 
Bran60 — Fasting 93.3 (69.7, 117.0) 
Bran60 — Mash 118.0 (94.6, 142.0) 
Bran60 — Premolt 120.0 (96.2, 143.0) 
Bran80 — Fasting —2.2 (—25.8,21.4) 
Bran80 — Mash 22.1 (— 0.9, 46.3) 
Bran80 — Premolt 24.3 (0.7, 47.9) 
Fasting — Mash 24.9 (1.3, 48.5) 
Fasting — Premolt 26,5 (2.9, 50.1) 
Mash -— Premolt 1.6 ( — 22.0, 25.2) 


g. Width of interval estimate as well as direction and magnitude of separation between two 
means. Clearly, the separation between the Bran60 stage and each of the other stages is much 
greater than the separation between any other pair of stages. 
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Chapter 4 
loa 
8 
1 ” 
o 
2 
3S 
: if 
ae 2 
© J 
3 D 
” o 22! 
id) wo 
o re 
S 
s o 
ry ia] 
eS 
lox 
(dp) 
So 
S 
a = 0 1 2 1000 2000 3000 4000 
Standard normal quantiles Fitted 
Levene(Med) test for homogeneous variances, Fo = 23.42, F(.05, 3,20) = 3.10, Reject 
b. B = 1.73, p=1-— 1.73 = — 0.73, use reciprocal transformation from ladder of powers, 


z=I1/y 
c. Residual plots after reciprocal transformation 


3 ° 
=] 28 
S =e 
3 
” 
3 o 
” 5 2 ee 
@ 8 S 
= 8 
e 2? fe) 
N © = 
°o © 3 
3 2 3c 
So = 
: lox 
”) 
3 
=) ra 
2 ° 
: o 
“2 “1 0 1 2 0.0004 0.0008 0.0012 0.0016 
Standard normal quantiles Fitted 


Levene(Med) test for homogeneous variances, Fy = 1.56, F(.05, 3,20) = 3.10, Do not reject 
d. Conduct analysis on coded value z = 1000 x (1/y) 


Source df SS MS Fo 


Linear l as 47d - S702 
Quadratic l UG7 0635 . 11.55 
Cubic 1 0.04 0.04 0.69 
Error 20 Loy - 0.05 


Quadratic polynomial is significant, F'(.05, 1,20) = 4.35 


< a. Binomial 
. sin-!/7, where 7 = (y + 3/8)/(n + 3/4) 
Cc. Source df SS MS Fo 
Clones 8 0.05 0.01 1.00 
Error 9 005. . G01 


Chapter 5 


5. 


a. 


sia 
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Means: 0.47, 0.45, 0.40, 0.32, 0.49, 0.44, 0.46, 0.40, 0.48 

d(.05, 8,9) =2.81 M = 2.81,/2(0.01)/2 = 0.281 

All clones are selected, indicating there are no differences between the highest rooting clones 
and the others. 


f-values are 0.0333, 0.1000, 0.1667, 0.233, 0.3000, 0.3667, 0.4333, 0.5000, 0.5667, 0.6333, 
0.7000, 0.7667, 0.8333, 0.9000, 0.9667 

q(f;) are — 1.83, — 1.28, — 0.97, — 0.73, — 0.52, — 0.34, — 0.17, 0.00, 0.17, 0.34, 0.52, 
0.73, 097, 1.28, 1.85 


16° 20 (22 24 we Ze 


16 


14 


-1 0 1 


Standard normal quantiles 


Yj = Wat Ci; t= 1,2;2..,5 g= 1,2,..48 
a; = random effects of sires with mean 0 and variance oa? 
e;; = random effects of progeny with mean 0 and variance a? 


Source df SS MS ___E(MS) 
Sires 4 5,591.1 1,397.8 o2+802 
Progeny 35 16,232.7 463.8 o2 


= 463.8, ¢2 = 116.7 

2 90% CI: (326, 722), @2 90% CI: (— 4, 1387) 
Fy = 3.01, F(.05, 4, 35) = 2.65, Reject 

DB, = 0.20, 90% CI: (.02, 0.67) 


Yij = UT A; + Ci; i= 1,2,...,8 j= 1,2,3,4 
a; = random effects of cottonseed lots with mean 0 and variance o2 


e;; = random effects of seed samples within lots with mean 0 and variance o? 


é 
Source df SS MS E(MS) 

Lots 7 13,696.5 1,956.6 o2+402 

Samples 24 $548.2 2312 @ 
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bo S23, A 

c. 662.6 

d. 34.9% to samples within lots and 65.1% to lots 
2 25.7 

f. 


See Section 5.8 or Section 5.10 for applications. Either determines optimum allocations of 
samples based on cost or number of samples per lot to achieve a test of hypothesis power. 


6. a. yf =e Reg dy CTH 12 = 23 FS 12,3 
7; = fixed effect of manufacturers 
e;; = random effects of filters with mean 0 and variance o? 
d,;, = random effects of filter tests with mean 0 and variance a7 


Source df SS MS E(MS) 
Manufacturer 1 1.8689 1.8689 03 +30? +96? 
Filters 4 3.7413 0.9353 62+ 302 

Tests 12 0.3047 0.0254 of 


b. Fo = 20; £05, 1,4) = 7.71, Do not feject 
c. Mean (std err): 0.47(0.32), 1.12(0.32) 
95% CI: ( — 0.42, 1.36), (0.23, 2.01) 
d. c,; = 200,c. = 1, 63 = 0.0254, G2 = 0.3033 
n = S tests, r= 8 Miters 


8. a. Yij = pt at egt dizx ee ee Bee ye nS a ae Se ere 
a; = random field effects with mean 0 and variance o? 
e;; = random section effects with mean 0 and variance o2 
d;j, = random soil sample effects with mean 0 and variance or 


Source df SS MS E(MS) 
Fields 14 $443 103 a7 + 1.190? + 2.380? 
Sections. . 15. 11353. 2077 04 + 1.207 

Samples 6 8.80 147 a3 


. B=147, 6 = = 058, 77 = 011 
ce. M =0.78, v=15, Fo = 0.78/0.77 = 1.01, F(.05, 14, 15) = 2.42, Do not reject 
d. F =0.77/1.47 = 0.52, F(.05, 15, 6) = 3.94, Do not reject 


Chapter 6 
| Foe Yisk = +0; + Bj +(aB)ij + eije fet. 2.3 ge i, 2- k= 1,2,35,4 
a; = fixed effect of alcohol, @; = fixed effect of base 
(aZ),; = interaction effect between alcohol and base 
€ij; = experimental error, mean 0, variance o? 


Source df SS MS 

Alcohol 2 540 2.10 
Base l 6.351" G51 
Interaction Pelee 5 AS gies oe 
Error 18 36.76 2.04 


b. Alcohol marginal means: 89.98, 91.04, 90.10; standard error = 0.50 
Base marginal means: 89.85, 90.89; standard error = 0.41 


a 


Residuals 


i 


a. 
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Alcohol 
Se Soe es Fe 
] 90.83 89.80 88.93 
2 89.13 92.28 91.28 
Cell means standard error = 0.71 


Fo = 5.53, F'(.05, 2, 18) = 3.55. Reject 


Contrast: (Base 1 — Base 2) for each alcohol, s, = 1.01, 
Bonferroni t(.025, 3, 18) = 2.64 


Alcohol c to 
1 1.70 1.68 do not reject 
Z —2.48  —2.46 donot reject 
3 —2.35 —2.33 donot reject 


Levene(Med) test, Fp = 0.51, Do not reject equal treatment variances hypothesis 


° 
© 
<a 


0.90 1.25 


Square root absolute residuals 
0.55 


0.20 


-2 -1 0 1 2 89 90 91 92 


Standard normal quantiles Fitted values 


Yijk = pray pj +(ap)i+ C ijk = 1, 2,3,4 J 1,2,3,4 k= Se: 
a; = fixed effect of fabric, 3; = fixed effect of temperature 

(aG);; = interaction effect between fabric and temperature 

€ijk = experimental error, mean 0, variance ad 


Source df SS MS 

Fabric 3 41.88 13.96 
Temperature 3 283.94 94.65 
T linear l 262.14 262.14 
T quadratic ] 27a 21.98 
Fabric x Temperature 9 15.86 1.76 
F x T linear 3 11.38 be a 
F x T quadratic 3 2.69 0.90 
Error 16 0.80 0.05 


Fo = 35.24, FOS, 9, 16) = 2.54 Reject 

Temperature linear Fy = 5242.9, Temperature quadratic Fp = 435.6, 
F(.05, 1, 16) = 4.49, Reject both null hypotheses 

Fabric x Temperature linear Fp = 75.4 

Fabric x Temperature quadratic Fp = 18.0 


~ F(.05, 3, 16) = 3.24, Reject for Fabric x Temperature linear and reject for 


ric x Temperature quadratic 


Fab- 
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e. 
10 
O Fabric 1 - 
O Fabric 2 re 
4 Fabric 3 e 
oO 4 ; 
Pag © Fabric 4 Ps Pe . 
fey) - 
© 
& 
5 
210 219 220 2209 
Temperature 
a Source df MS Fo 
Insecticide (J) 4 199.84 
Herbicide (A) 4 171.70 
I linear 1 4.96 0.03 
I quadratic 1 soot 318 
I cubic 1 162.90 0.94 
A linear 1 372.91 3.29 
H quadratic 1 108.75 0.63 
H cubic 1 465 0.03 
Tlin x Alin + 3,10249 1733 
Ilin x H quad ] 236.60; 1.36. 
Tlin x Hub 1 6.76 0.04 
I quad x A lin EL 124,180.30... 6.78 
_ I quad x H quad 1 195.00 1.12 
I quad x H cub ] aio 20 
Icub x AH lin ] 379.80 2.18 
I cub x H quad l 23.41 0.13 
Icub x H cub 1 18.49. 0O.11 
Error 15 174.00 


b. Reject null hypothesis if Fp > F(.05, 1, 7.5) = 3.97. 
c. wet = 105 — 1.177— 40.7H + 0.00749/2 + 1.72()(H) — 0.0145(17)(A) 


To A. Yaget = Wt 4 + Bj + (AB)ig + VE (1) ik + (BY) je + (BY) ight © ign 
i.ak, = 12,3 2252 
a; = fixed soil texture effect, 8; = fixed salinity effect, +, = fixed water effect, 
(a3);; = interaction between texture and salinity, (ay); = interaction between texture and 
water, (G)j~ = interaction between salinity and water, (a(@-);;, = three factor interaction, 
€ijk] = experimental error with mean 0 and variance a? 
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Source df SS MS Fo 

Texture (7) 2 1240 6.23 11098 
Salinity (S) a2 Sa 29.07 31777 
Water (W) 2 26.28 13.14 234.09 
rx § 4 1,28: = 0.32 =e 
Tx W 4 304° 0.76 13:52 
Sx W 4 2062. % 3.15 — 9182 
TxSxWw 8 £635. % 0.21 3.68 

7 


Error 2a 1:52-~ 0.06 


b. Soil x Salinity x Water means 


Salinity 2 8 10 

Water 0 5 15 0 _ 15 0 5 15 
teeee 154-1 a Ss = Ft 0.20 
Loam OG. 255. 340 07.029: 056. G4. 020 0.39 
Clay 144 308 356 U62 095 164-7 049: 080 2.41 


Cell means standard error = 0.17 


Texture x Salinity Means Texture x Water Means 
Salinity Water 
Texture 2 8 16 0 5 15 
L-eane. 195° O10 U.iZ 0.23 0.69 1.24 
Loam 2.900.536 027 0.43 0.98 2.12 
Clay ine et a O85. .3.61 523.20 
Cell means standard error = 0.10 Cell means standard error = 0.10 


Salinity x Water Means 


Water 
Salinity 0 Ls 15 
y) 0.98 2.42 4.78 
8 030 045 O42 
16 0.23 0.40 1.00 
Cell means standard error = 0.10 


Texture means: 0.72, 1.18, 1.89, standard error = 0.06 
Salinity means: 2.73, 0.51, 0.54, standard error = 0.06 
Water means: 0.50, 1.09, 2.19, standard error = 0.06 

c. Reject for Texture, Salinity, and Water, F'(.05, 2,27) = 3.35 
Reject for all two factor interactions, F'(.05, 4, 27) = 2.73 
Reject for three factor interaction, F'(.05, 8, 27) = 2.31 
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11. a. 


Source df SS MS Fo 

T 2°254246 6.23 110.98" 
S (lin) 1%°"3855 38.55. 686.70" 
S (quad) 1 19.58 19.58 348.84* 
W (lin) 1 26.28 26.28 468.04* 
W (quad) 1 0.01 0.01 0.13 
T x S(1) 2 0.96 0.48 8.55% 
T x S(q) 2 G32. 0.16 - 2.99 
T x Wh) 2 298 149° 26:57" 
T x Wq) 2 0.05 0.03 0.48 
S(1) x W(1) t TE18 1218 21690" 
S(1) x W(q) 1 613 013 32 
S(q) x W(1) 1 8.30 8.30 147.79* 
S(q) x W(q) 1 002° --0:02— 0:30 
Tx S(1)x WA) 2 1:27 -0.63-<- 11.27" 
Tx S(1)x Wq) 2 O19: 009 1.67 
Tx S(q)x Wi) 2 O18 6.09 138 
T x S(q)x Wq) 2 002-.-0.01-... 0418 


*Reject the null hypothesis if Fy > F(.05, 1,27) = 4.21 
or Fy > F(.05, 2, 27) = 3.35. 


The SS Yates and MS Yates columns are the Weighted Squares of Means calculations, which 
can be produced by programs (MINITAB GLM with “Adj. SS,” SAS GLM with “Type Ill 
SS,” or Splus with the “summary.aov” function). Notice the SS sequential column. The 
Interaction and the Error sums of squares are identical to those for the Yates columns. The 
interaction and error sums of squares partitions are obtained by a sequential fit of terms to the 
model in the order presented in the Source column and may be used equivalently to test 
interaction. Most programs also produce the sequential fit (MINITAB GLM with “Seq SS,” 
SAS GLM with “Type I SS,” and the default for Splus “summary.aov” function). 


Source df SSsequential SS Yates MS Yates 
Base 1 8.897 14.242 14.242 
Alcohol Z 4.105 7.675 3.838 
Interaction 2 29.240 29.240 14.620 
Error 10 19.873 19.873 1.987 


Fy = MS(Interaction)/ MSE = 14.620/1.987 = 7.36, Reject the null hypothesis of no 
interaction since Fy > F(.05,2, 27) = 3.35. Not necessary to test main effects. 

Mean(std. error) 

Base: 89.8(0.49), 91.8(0.61). Alcohol: 90.0(0.64), 92.0(0.81),90.4(0.54) 

Interaction: hj; = observed cell mean and 3, = JV MSE/r;; 

91.1(1.00), 89.3(0.81), 88.9(0.70), 89.0(0.81), 94.7(1.41), 91.8(0.81) 

Alcohol level, difference (std. error), to 

Alcohol 1, 2.02 (1.29), 1.57:Alcohol 2, — 5.43(1.63), —3.33:Alcohol 3, — 2.84(1.08), 
— 2.64 

Use the Bonferroni t for 3 comparisons and 10 degrees of freedom and Reject Ho if 
\to| > cad 

Cell means remain the same. Observed marginal means for Base: 89.5, 91.0 and for Alcohol: 
89.8, 90.6, 91.8. 


Yijk = M+ azp+bj+ (abate t=1,2,3,4 7=1,2,3,4,5 k=1,2 


a; = random run effect, mean 0, variance o? 
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b; = random patient effect, mean 0, variance o? 
(ab);; = random run x patient interaction effect, mean 0, variance a7, 
ijk = experimental error, mean 0, variance og? 


Source df MS E(MS) 
Runs (R) 3 364.46 «7? + 207, + 100? 
Patients(P) 4 13,152.03 0? +202, +802 
Rx P 12 9.12 024203, 

Error 20 6.61 o? 


35.53; 1,642.86; 1.26 

d. Interaction: Ho: o?, = 0, Hy:o2, #0, Fo = 1.38, F(.05, 12, 20) = 2.28, Do not reject 
Patients: Ho: 0? = 0, Ha: 07 #0, Fo = 1,442.11, F(.05, 4, 12) = 3.26, Reject 

Runs: Ho: 0? = 0, Hy: 02 # 0, Fo = 39.96, F(.05, 3, 12) = 3.49, Reject 


a 


‘> 


a. Yijk = + 05 + Diy) + CEG; ee ere 7=1,2,3,4 Ee 
a; = fixed effect of alloys 
b,;) = random effect of castings within alloys, mean 0, variance Osa) 


Ck(ij) = random effect of bars within castings, mean 0, variance oe b) 


b. 
Source df MS E(MS) 
Alloys 2 14.69 024 +207) +80 
Castings (Alloys) 9 0.52 O26) + 205 4) 
Bars (Castings) 12 0.36 O16) 

c. Fo = 28.25, F(.05, 2, 9) = 4.26, Reject 


d. Means (std err): 14.7(0.25), 16.7(0.25), 14.1(0.25) 
95% CI: (14.1, 15.3), (16.1, 17.3), (13.5, 14.7) 
e. Bars: 0.36, Castings: 0.08 


A. Yigkd = t+ 04 +b; + (ab)i5 + C5) + (Ac) inc + eign 
a; = fixed machine effect 
b; = random plot effect, mean 0, variance oF 
(ab);; = random effect for machine by plot interaction, mean 0, variance a, 
Cx(j) = random sample within plot effect, mean 0, variance 218) 
(ac);,(;) = random effect for machine by sample interaction nested within plot, mean 0, 
variance Oo o(b) 


€;jk1 = random error, mean 0, variance o? 


Source df MsS* E(MS) 

Machine (M) 1 25.29 6° + 2074, +402, +1262 

Plot (P) 2.52482 o° + 207216) aa 40°) + 402, + 80? 
M xP 2 0.10 07 +2074) + 402, 

Sample (S) withinP 3 8.86 0?+4202,,. +402, 

M x S within P 3 0.54. a2 + 207.51) 

Error 122 O47 =o 


“Mean square value x 102 


c. Fy = MSM/MS(MP) = 252.9, F(.05, 1, 2) = 18.51, Reject 
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i 


Yij = METH Pj t Cig 


Py 6 dteek: 


ey oe ig 


7; = fixed irrigation method effect, p; = fixed block effect 
€;j = experimental error, mean 0, variance o? 


Experimental errors are random and independent with mean 0 and variance o 


Source df SS MS 

Blocks 7 “SD 7-30T 65,358 
Irrigation 5 47,842 9,568 
Error 35 16,888 4,197 


2 


Treatment 


and block effects are additive. Randomization justifies assumption of random, independent 
errors. The relationship between treatment effects is consistent among the blocks. 


2295324 


d(.05, 5,35) =2.64 D(5, .05) = 85.5 


Irrigation Mean y;—¥Y, 
Flood 229.6 control 
Trickle 299.6 70.0 
Basin 290.4 60.8 
Spray 223.8 —5.8 
Sprinkler 292.0 62.4 
Sprinklertspray 291.0 61.4 


No treatment is significantly different from flood, since all 


intervals include 0. 


95% SCI 
(L,U) 
(— 15,5,455.5) 
(— 24.7, 146.3) 
(— 91.3, 79.7) 
( — 23.1, 147.9) 
(— 24.1, 146.9) 


RE = 0.99(3.17) = 3.14. More than three times as many replications required by the 


completely randomized design to have same variance of treatment mean. 


Yigj = MOTH + Pj t i; 


7; = fixed fertilizer effect, p; = fixed block effect 
e;; = experimental error, mean 0, variance co? 


Source df SS MS 


Blocks 4°2-4,95 1.24 
Fertilizer 4 72.12 18.03 
Error 16. 12-76 “4:80 
Contrast 


= 
Nene vs; others(l = 47" 7, — 3, 


Nitrogen(0, —1, +1, —1, +1) 
P,0; (0, — 1, —1, +1, +1) 
Interaction(0, +1, —1, —1, +1) 


“Reject null hypothesis if Fy > F(.05, 1, 16) = 4.49 


0.45, 0.80, 0.80, 0.80 
RE = 0.98(1.09) = 1.07 


Yig =U Pit Vet THT Ci; 


7, = fixed construction method effect 
e,; = experimental error, mean 0, variance a? 


i,j,k = 1,2,3,4 
pi = fixed time period (row) effect, 7; = fixed technician (column) effect 


Pee 5 = 12S 


SS 
I 


2a 


4.32 
64.44 
0.22 


Fo 
3.93 


5.42* 
sory 
0.28 


10. 
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Source df SS MS 
Time 3 467.19 155.73 
Technician 3 1749 5.73 
Method : 145.69 48.56 
Error 6 22.88 3.81 
b. 0.98, 1.38 
ce. @(.05, 4, 6) = 4.90, HS D(4,.05) = 4.80, 95% SCI are y; — y; + 4.80 
95% SCI for differences that do not include 0 are (A, D), (A, C) 
d i =299Ge 102 
a. Yih = UW+TEt Pj t+ (TP)ig + Cizx ee aa ae eae 
7; = fixed nitrogen rate effect, p; = fixed block effect 
(Tp)i; = fixed rate x block interaction effect 
€ijk = experimental error, mean 0, variance a? 
The interaction effect can be included in the model and its sum of squares partition, computed 
by virtue of the fact that two plots for each treatment were included in each block. Thus, 
experimental error variance can be estimated separately as the variance among plots treated 
alike within each block. 
Source df SS MS 
Blocks 1 1022.45 1022.45 
Rates 4 4813.00 1203.25 
Rate linear i 2856.10. 2356,10 
Rate quadratic l 1716.07 1716.07 
Rate cubic 1 60.03 60.03 
Block x Rate 4 287.80 71.95 
Error 10 422.50 42.25 
b. Fo = 1.70, F(.05, 4, 10) = 3.48, Do not reject 
c. See SS partitions in analysis of variance table. 
Linear: Fp = 67.6, Quadratic: Fp = 40.62, F(.05, 1, 10) = 4.96 
Reject null hypothesis for linear and quadratic effects 
d. Fo = 1.42, Do not reject null hypothesis; cubic not significant 
a. 
Source df SS MS 
Total 45 614,708 
Blocks (unadj.) 7 432,384 61,769 
Irrigation (adj.) 5 51923 ..10,385 
Error 33 130,402 3,952 
bh. Fo = 2:63; PCOS, 5; 33) =" 2-50; Reject 
c. Mean(std err): 290.1(24.1), 290.4(22.2), 223.8(22.2), 292.0(22.2), 291.0(22.2), 213.7(24.1) 
d. Larger than those treatments without lost plots 
e. (1) 34.1, (ii) 32.8, (iil) 32.8, (iv) 31.4 
f. Standard errors increased if either or both treatments had missing plots; decreased power 
a. Mean Square* 


Year | Year 2 
Source df Spring Winter Spring Winter 
Treatment 3 42.95 +. 81.05 193.15 48.65 
Error 4 12.99 18.11 12.19 45.31 


* 
Mean square value x 10° 
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b. F max = 45.31/12.19 = 3.72, Fos max = 20.6, =4,yv =4 
Do not reject hypothesis of equal variances 
c. Source df MS E(MS) 


Year (Y) l 13.61 0? +20}, + 407, + 804, + 1604 
Season (S) 1 1,911.01 0? +202, + 802, + 1602 
Treatment (7) 3 257.70 oa? + 207,, +40}, + 867 

YxsS l 94.61 07? +20}, + 80), 

TxS 3 15.94 02 +202, + 463, 

Tx Y 3 28.47 0? + 207, + 403, 

FTXKTXS 3 13.27 o? +203, 

Error 16 2 a 


* 
Mean square value x 10° 


d. Season: Fp = 1911.01/94.61 = 20.2, F(.05, 1, 1) = 161.4, Do not reject 
Treatments: Fo = 257.7/28.47 = 9.05, F(.05, 3, 3) = 9.28, Do not reject 
Y x S: Fo = 94.61/13.27 = 7.13, F(.05, 1, 3) = 10.13, Do not reject 
T x S: Fy = 15.94/13.27 = 1.20, F(.05, 3, 3) = 9.28, Do not reject 
Tx ¥: Fy = 28:47/13.27 = 2.15, FCO, 3, 3) = 9.28, Do notireject 
Tx YX S: Fo = 13.27/22.15 = 0.60, F(.05, 3, 16) = 3.24, Do not reject 


Chapter 9 
ioe? eee 
E=; 


Cc. Yij = MTGE Pit Ci; i= 1,2,3,4 j = 1,2,3,4,5,6 
7; = fixed temperature effect, p; = fixed block effect 
€;j = experimental error, mean 0, variance a” 


Source df SS MS 

Blocks (unadj.) 5 613.66 122.73 
Temperature (adj.) 3 718.29 239.43 
Temp linear ] 677.08 677.08 
Temp quadratic 1 133 133 
Temp cubic ] 33.87 3387 
Error 3 41.00 13.67 


Test for temperature: Fy = 17.51, F'(.05, 3, 3) = 9.28. Reject 
means: 26.46, 23.82, 10.07, 3.61; standard error = 2.50 
3.70 
f. See analysis of variance table for SS partitions 
Linear: Fo = 49.53, F(.05, 1, 3) = 10.13, Reject 
Quadratic: Fy = 0.54, F(.05, 1, 3) = 10.13, Do not reject 
g. No. Fo for cubic = 2.48, F(.05, 1, 3) = 10.13, Do not reject 


eo &. 


ee Yijm = M+ t+ ppt ¥mt Cigm toamet 152,24 bf ay =1,2,3,4 
7; = fixed intersection type effect, p; = fixed observer effect 
Ym = fixed city effect 
€ijm = experimental error, mean 0, variance o? 
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b. 4.19 


CHAPTER 10 

Source df SS MS 

Observer 3 146.51 48.84 

City (unadj.) 6... 1,342.07 233.68 

Intersection (adj.) 6. 9,972.46 1,662.08 

Error 12 368.85 30.74 

Plan 9B.5, EF = 0.88 
d. d(.05, 6, 12) = 2.58, M = 2.58(4.19) = 10.8 
95% SCI 
Intersection Yy,; may, D D—-M D;+M (L,U) 
j#i 

1 68.4 28.6 39.88 29.0 50.6 (0, 50.6) 
2 28.6 28.7 —-0.1 —10.9 10.77 ( — 10.9, 10.7) 
3 28.7 28.6 0.1 —10.7 10.9 (— 10.7, 10.9) 
4 40.4 28.6 11.8 1.0 pak (0, 22.6) 
5 44.8 28.6 16.2 5.4 27.0 (0, 27.0) 
6 85.2 28.6 56.6 45.8 67.4 (0, 67.4) 
7 32.6 28.6 4.0 — 6.8 14.8 ( — 6.8, 14.8) 


4 


Select intersection types 2, 3, and 7 with P(CS) = 0.95. 


t=5,r=4,k =4,6=5 
iA 3 


Yigk = M+ Ti+ V5 + Pm(y) + Cijm Pee PE et ee 
7; = variety effect, y; = replication effect, pj) = block within replication effect 
€ijm = experimental error, mean 0, variance re 


Source df SS MS 

Replication 6 67,357: olbaiZ 
Block (unadj.) 7 336.90 48.13 
Variety (adj.) 7 ~=1,245.99 178.00 
Error 33 goni9 — 10.23 


Null hypothesis test for variety: Fp = 17.4, F(.05, 7,35) = 2.29, Reject 
Variety means (72): 43.3, 54.4, 38.4, 51.8, 47.5, 49.4, 42.5, 51.0 
Standard error = 1.29 

1.85 

0.86 


Yijlm = H+ Ti + Bm + Piim) + Yum) + Cijlm 

T; = variety effect, 8,, = replication effect, pi) = row within replication effect 
Yi(m) = column within replication effect 

€ijm = experimental error, mean 0, variance a? 
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Source df SS MS 

Replication 3 a Fe ee ee 
Column (unadj.) 8 29.30 3.66 
Row (unadj.) 8 37.50 4.69 
Variety (adj.) 8 130.87 16.36 
Error 8 6.04 0.75 


Null hypothesis test for variety: Fp = 21.81, F'(.05, 8, 8) = 3.44, Reject 
Results below obtained from GLM in SAS: 
Variety means (fi, ): 59.1, 53.4, 55.1, 53.0, 55.2, 54.2, 54.4, 48.7, 56.9 
Standard error = 0.60 

b. 0.87 from GLM in SAS 

e050 


Chapter 11 
1. Block I, Units 1-8: (1), ab, ac, ad, bc, bd, cd, abcd 
Block II, Units 9-16: a,b, c, d, abc, abd, acd, bcd 


3. Confound ABC in Rep I, ABD in Rep II, ACD in Rep III, and BCD in Rep IV. 
Rep I, Block I: (1), d, ab, ac, bc, abd, acd, bcd 
Rep I, Block II: a, b,c, ad, bd, cd, abc, abcd 
Rep II, Block I: (1), c, ab, ad, bd, abc, acd, bcd 
Rep II, Block II: a, b, d, ac, bc, cd, abd, abcd 
Rep III, Block I: (1), 6, ac, ad, cd, abc, abd, bcd 
Rep III, Block II: a, c, d, ab, bc, bd, acd, abcd 
Rep IV, Block I: (1), a, bc, bd, cd, abc, abd, acd 
Rep IV, Block II: b, c, d, ab, ac, ad, bcd, abcd 


5. a.” BSAC ABC, CDE 
b. Yes, because another choice for defining contrasts will not leave main effects or two-factor 
interactions confounded. 
c. Use the plan in Table 11A.1 with ACE, ABEF, and ABCD as defining contrasts. No main 
effects or two-factor interactions are confounded. 


a Source df SS MS Fo 
Replication l i246. 41.28 
Blocks 2 56.31 28.16 
A (speed) 1 639.03 639.03 80.28" 
B (angle) l EE 28... - 416 1 Ae 
C (lubrication) 1 52:53 52,545 260 
D (alloy) L- 424,03 124.032 43:53" 
AB i —- 132.03 -13703> 1659° 
AC 1 3.78 3.78 0.48 
AD l 34.03 3403 4.28 
BC a3 253- --0.32 
BD 1 0.03 0.03 0.00 
CD 1 2628-2628... 3.30 
ABC | 20805 -203555 23:76 
ABD bo 1578 Yi378 22a 
ACD l 7.03 7.03 0.88 
BCD l 7.03 7.03 0.88 
Error 14 111.44 7.96 


“Reject null hypothesis if Fp > F(.05, 1, 14) = 4.60 
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Effect (estimate): 

A( — 8.94), B(— 1.19), C( — 2.56), D(3.94), AB(4.06), AC(0.69), AD( — 2.06), 

BC(— 0.56), BD( — 0.06), CD(1.81), ABC(— 5.06), AB D(4.69), AC D( — 0.94), 

BC D( — 0.94), standard error = 1.00 

Prepare a plot of speed x angle cell means for each level of lubrication and for each level of 
alloy to interpret the speed x angle interaction as levels of lubrication or levels of alloy 
change. Although the main effects of speed, lubrication, and alloy are significant, as is the 
speed x angle interaction, the modification of their effects by the significant three factor 
interactions must be examined. 


generalized interactions are AC and B? 
generalized interactions are AB2C and BC 
Choose (b) to avoid confounding a main effect. 


elle, 4,9, 416 

. 14,2, £3.16 

s de, Ls oy ae 

. bt6,4, T1158 
AZ, oe, 31, 16 
1164; 6,-97,'63,32 


ABEF 

IV 

A = BCD, ACDEF, BEF, B = ACD, BCDEF, AEF; C = ABD, DEF, ABCEF 
D = ABC, CEF, ABDEF; E = ABF, CDF, ABCDE; F = ABE, CDE, ABCDF 
AB = CD, EF, ABCDEF; AC’ = BD; ADEF, BCEF; AD = BC, ACEF, BDEF 
AE = BF, ACDF, BCDE; AF = BE, ACDE, BCDF; BC = AD, BDEF, ACEF 
BD = AC, BCEF, ADEF; BE = AF, ACED, BCDF; BF = AE, ACDF, BCDE 
CD = AB, EF, ABCDEF; CE = DF, ABCF, ABED; CF = DE, ABDF, ABCE 
DE = CF, ABCE, ABDF; DF = CE, ABDE, ABCF; EF = AB, CD, ABCDEF 


Yes. Suppose the 27,7 design with factors A, B,C, D, and F’ using 8 units with levels of D and 


E generated as D = AB, and E = AC. Fold over the design by adding a second 23? design with 
all signs reversed from the first design, including the J column so that J = F’, a sixth factor. Then 
D = ABF and E = ACF. Design generators have four letters for a Resolution IV design using 
16 units for a 2°? design. 


a. 


b. 


D, E,and BE since they are no longer confounded with other main effects or two-factor 
interactions 

Use I = ABC = CDE. All main effects are no longer confounded with significant two-factor 
interactions or other main effects. However, AD = BE and AE = BD, and potential two-factor 
interactions are grouped as pairs. 


4—] 
2iv 


> SS 


V 

E= — ABCD 

See Table 12.3 and change signs on alias 

No three-factor or higher interaction 

Effect (estimate): 

A(0.145), B(0.087.5), C(0.0375), C(0.0375), D( — 0.0375), E( — 0.47), AB(0.015), AC(0.095), 
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a. 


AD(0.03), AE( — 0.1525), BC( — 0.0675), BD(0.1625), BE( — 0.405), CD(0.0725), 
CE(0.135), DE( — 0.315) 

Choose E, BE, and DE as significant effects from normal scores plot. MSE = 0.03978 from 
analysis of variance. Effect standard error = 0.0997. 


Source df SS MS 

E 1 0.88360 0.88360 
BE 1 0.65610 0.65610 
DE 1 0.39690 0.39690 


Error 12 0.47738 0.03978 


The normal probability plot to evaluate this model reveals no unusual residuals from assumed 
normal distribution. 


a? = 6.3, 5 df, 

Bo i 50/25, By ste 7.5, Bo = 10.75, standard error = 0.89 

Ho: By = 0, to = 8.42; Ho: Bo = 0, to = 12.08; t(.025, 5) = 2.571, Reject both; linear effects 
significant 

Bis = 1.5, standard error = 0.89, to = 1.69, t(.025, 5) = 2.571, Do not reject; No significant 
interaction 

vp - 3, = = 0.25, standard error = 1.98, t) = — 0.13, t(.025, 5) = 2.571, Do not reject; No 
significant departure from linear surface 

(A, B), (25, 35), (40, 56.5), (55, 78), (70, 99.5), (85, 121), (100, 142.5) 


2” Design Axial 
1 x2 L3 L4 H ota x2 23 D4 
—-1 -1 -1 -1l —2 0 0 0 
+1 -1 -1 -1 +2 0 0 0 
—-1 +1 -1 -1 0 -2 0 0 
+1 +1 -1 -1 0 +2 0 0 
—-1 -1 41 -1 0 0 -2 0 
+1 -1 +1 -1 0 0 +2 0 
—-1 +1 +1 -! 0 0 0 -2 
+1 +1 41 -!1 0 0 G +2 


—-1 -1 -1 +1 

+1 —1 -1 +1 

—-1 +1 -1 41 Center* 

+1 +1 -1 +41 Ly Bening Wa 
—-1 -1 41 41 0 0 0 0 
+1 -1 41 +1 *7 replications 
—-1] +1 41 41 

+1 +1 41 41 


a. 9 = 431 4+4.042, + 64.7522 — 8.80x? — 44.5523 + 1.752122 


b. 


10. 


Source 


Regression 


Linear 


Quadratic 


Error 


Lack of fit 
Pure error 


hRWwAITIWN MN 


SS 
47,552 
33,669 
13,883 

2495 
1,964 
831 


MS 
9,510 
16,835 
4,628 
399 
655 
208 


Complete model: Fo = 45.7, F(.05, 5, 4) = 6.266, Reject 
Quadratic components: Fo = 22.3, F(.05, 3, 4) = 6.59, Reject 


Cc. 


Linear components: Fy = 80.9, F(.05, 2, 4) = 6.94, Reject 


Lack of fit: Fo = 3.2, F(.05, 3,4) = 6.59, Do not reject 
Quadratic model is significant with no significant lack of fit to quadratic 
Methionine: x; = 0.302, M = 0.986 


Carotene: a= 0.733, C = 75.91 


d. 9 = 455 — 44.572? — 8.7422 
Z, = — 0.725 — 0.0242, + 1.000z2 
Z2 = — 0.320 + 1.0002; + 0.02425 
From design specified in Table 13.4 for near rotatable and orthogonal blocking designs 


Block I 


—1 
+1 
— 1 
= ae 
— | 
+71 
—1 
+t 

0 

0 


Sulphur 


0.3 
0.1 
0.1 
0.2 
0.2 
0.1 


b. 2.02, 6 df 
Psuedocomponent equation: 
Y = 12.85%, + 3.00%. + 3.45 %3 + 39.7072 + 40.40%) 73 + 10.90 %o%3 
Actual component equation: 
Y = — 559.702, — 140.9522 — 58.70xr3 + 992.502, 22 + 1010.0021 23 + 272.502973 


c. 


Source 


Regression 


—] 
—1 
+1 
+4 
—] 
—1 
a 
+1 

0 

0 


Linear 


Quadratic 


Error 


—1 
—1 
—1 
—I 
+A 
ae 
+t 
I 

0 

0 


Asphalt 


0.2 
0.4 
0.2 
0.3 
0.2 
0.3 


df 


s, 
Z 
3 
6 


Block II 

1.682 0 
— 1.682 0 
0 1.682 

0 —1.682 

0 0 

0 0 

0 0 

0 0 


a = 1.789 for Equation (13.7) 


Sand 


0.5 
0.5 
0.7 
0.5 
0.6 
0.6 


SS 
491.62 
262.42 
229.20 

12.10 


MS 
96.52 
1zi2t 
76.40 
2.02 
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fer) 


Test for linear: Fp = 64.96, F(.05, 2,6) = 5.14, Reject 
Test for quadratic: Fo = 37.82, F(.05, 3, 6) = 4.76, Reject 


Yijk = Mtartprt+dxnt+Bjt+(aPijteipz tk =1,2,3,4 g=1,2,3 

a; = fixed plant population density effect, G; = fixed hybrid effect, (a8); = fixed popula- 
tion x hybrid interaction effect, p; = fixed block effect, d;, = whole plot random error with 
mean 0 and ¢?, e;;, = subplot random error with mean 0 and variance o? 


Source df SS MS 

Blocks 3 408.98 136.33 
Population (P) 3 6,429.39 2,143.13 
P linear ke §,658.23-,.5,638.23 
P quadratic ] 767.64 767.64 
P cubic l 3.51 3.51 
Error(1) 9 466.54 51.84 
Hybrid (A) 2 881.41 440.70 
HxP 6 207,51 34.58 
Hx P linear 2 72.74 36.37 
H x P quadratic 2 92.30 46.15 
Hx P cubic : 42.47 21.24 
Error(2) 24 595.70 24.82 


Mean Head Seed Weight (g) 
Plants per Meter of Row 


Hybrid 10 15 25 40 Hybrid Means 
TAM 680 j8e- Sa 48.4 15.5 25.8 
RS 671 R52 ° = 5.0 = 2A. TIA 30.4 
Tx 399 SL. 808 259 216 36.3 
x Tx 2536 | 
Population 47.07 36.1 228 17.4 
means 


Factor(standard error of mean): Population(2.1), Hybrid(1.2), Cells(2.5) 

(i)) 1.8, (ii) 2.9, (111) 4.11 

Interaction: Fo = 1.39, F(.05, 6,24) = 2.51, Do not reject 

Population: Fo = 41.34, F(.05, 3,9) = 3.86, Reject 

Hybrid: Fo = 17.76, F(.05, 2, 24) = 3.40, Reject 

Subplot: RE = 0.98(1.3) = 1.27 

Whole plot: RE = 0.88(0.62) = 0.55 

See analysis of variance table for polynomial sums of squares partitions. No interaction 
partitions are significant with F'(.05, 2,24) = 3.40 as a critical value. 

Population linear: Fo = 109.15, Population quadratic: Fo = 14.81, Population cubic: 
Fo = 0.07, F(.05, 1,9) = 5.12, Reject for linear and quadratic 


Vijkim = + 04+ Bj + (AB) + PEF YVI+ det Omt (28) im+ (G8) jm 

+ (AB): jm + ekim 7,j=1,2 k,l =1,2,3,4 m = 1,2,3 
a; = fixed A effect, 6; = fixed B effect, (a3);; = fixed A x B interaction effect, p, = fixed 
row effect, +, = fixed column effect, d,; = random whole plot error with mean 0 and variance 
o. 6,,,. = fixed C. effect, (a);,, = fixed A.x C 
interaction effect, (G@)jm = fixed Bx C interaction effect, (a(@)jjim = fixed Ax BxC 
interaction effect, ez), = random subplot error with mean 0 and variance o? 
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ab 


Source df | Expected Mean Square 
Rows 3 

Columns 3 

A 1 02+307 +240? 
B 1 024307 +246 
AB 1 03 +304 + 120 
Error(1) 6 o02+30% 

C 2 02+ 160? 

AC 2 02 +862, 

BC 2 diet 862. 

ABC 2 02+46?,. 
Error) 24g? 


Yigkl = + 4 + pp + dig t+ Bet Vit (BY) Kit (@B) ik+ (@Y) at (BY) skit e ize 


i7= 1,2 “k,l = 1,2,3 


a, = fixed phosphorus effect, p; = fixed block effect, d;;= random whole-plot error with 
mean 0 and variance e. Gy = fixed nitrogen effect, y; = fixed water effect, (G-y),; = fixed 
nitrogen x water interaction effect, (a); = fixed phosphorus x nitrogen interaction effect, 


(ay), = fixed phosphorus x water 


interaction 


effect, 


(aBy)ix1 = fixed phospho- 


rus X nitrogen x water interaction effect, e;;,; = random subplot error with mean 0 and vari- 


ance a? 
Source df SS MS 
Blocks ] 0.67 0.67 
Phosphorus (P) ] 1.25 L235 
Error(1) 1 27.56 27.56 
Nitrogen (JN) 2 $32,768.65- 1,384.32 
Water (W) a 791.84 © 375.92 
NW 4 242.08 60.52 
PN 2 i271 6.35 
PW a 0.81 0.40 
PNW 4 13.87 3.47 
Error(2) 16 101.13 6.32 
Phosphorus Nx W 
Water Nitrogen 0 245 Means 
16 0 8.4 12.6 10.5 
130 Be a — Sas 
260 31.199336.7 37.2 
22 0 7.6 8.6 8.1 
130 20.7. = 20.8 20.8 
260 28.6 #273 279 
28 0 7.6 8.4 8.0 
130 20.6 19.8 20.2 
260 eas Zork 222 
Phosphorus Phosphorus 
Water 0 245 Nitrogen 0 245 
16 Atlas. .4) 8 0 7.8 9.8 
22 18.9 18.9 130 255° 248 
28 too 474 260 29.22 29.00 


654 CHAPTER 15 


Chapter 15 


c. 
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Means(std err): PNW(1.8), NW(1.3), PW(1.0), PN(1.0) 
P means: 20.8, 21.2, standard error = 1.2 
N means: 8.8, 25.1, 29.1, standard error = 0.7 
W means: 27.4, 18.9, 16.8, standard error = 0.7 
(i) 1.7, (ii) 1.0, (iti) 1.0, (iv) 1.8 
PNW: Fo = 0.55, F(.05, 4, 16) = 3.01, Do not reject 
PW: Fo = 0.06, F(.05, 2, 16) = 3.63, Do not reject 
PN: Fo = 1.01, F(.05, 2, 16) = 3.63, Do not reject 
NW: Fo = 9.58, F(.05, 4, 16) = 3.01, Reject 
W: Fo = 59.48, F(.05, 2, 16) = 3.63, Reject 
N: Fy = 219.03, F'(.05, 2, 16) = 3.63, Reject 
P: Fy = 0.05, F(.05, 1, 1) = 161.4, Do not reject 
Source df MS Fo 


N linear (N;) I 2,464.43 389.92* 
N quadratic (Nq) l 304.22 48.13* 
W linear (W)) l 672.04 106.33* 
W quadratic (W,) l 79.80 12.63" 
NW, 1 154.38  24.43* 
NW, l 0.46 0.07 
NW 1 40.89 6.47" 
N,W, 1 46.35 733° 
PN, 1 7.04 1.11 
PN, 1 5.67 0.90 
PW, l 0.01 0.00 
PW, | 0.80 0.13 
PNW, 1 9.77 E53 
PNW, l 0.01 0.00 
PN,W, l 0.46 0.07 
PN,W, l 3.64 0.58 


*Reject null hypothesis since Fy > F(.05, 1, 16) = 4.49 


Use a quadratic regression model with N and W for interpretations. 


Between-subjects design: A completely randomized design with three diet treatments and four 
subjects randomly assigned to each diet. 
Within-subjects design: Three repeated measures on each subject at 15, 30, and 45 minutes 
Minutes 
Diet “715 930 45 

1 WY 350 SED 

2 220 20.0: 115 

3 260 “27.2 = 35.8 


Yije = Bt astdn+Bj+(aP)jteiz 1,9=1,2,3 k=1,2,3,4 

a; = fixed diet effect, 3; = fixed time effect, (a8);; = fixed diet x time effect 

dj, = normally distributed independent random experimental error for subjects within diets 
with mean 0 and variance «7 

ei, = normally distributed random experimental error on repeated measures with mean 0 and 
variance o? 

The e;;, are assumed to satisfy the Huynh—Feldt condition of equal variance of the difference 
between all time periods 
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Source df SS MS 

Diet (D) 2 1,020.67 510.33 
Error (1) 9 413.33 45.93 
Time (7) z 170.17 =85.08 
T linear 1 92.04 92.04 
T quadratic l 7.13 —78A3 
DxT 4 869.67 217.42 
D x T linear 2 631.08 315.54 
D x T quadratic Ps 238.58 119.29 
Error (2) 18 128.17 7.12 


Diet x Time interaction: Fo = 30.5, F(.05, 4, 18) = 2.93, Reject 
Time: Fo = 11.9, F(.05, 2, 18) = 3.55, Reject 

Diet: Fo = 11.1, F(.05, 2,9) = 4.26, Reject 

Diet means: 28.5, 17.8, 29.7, standard error = 1.96 

Time means: 22.3, 27.4, 26.3, standard error = 0.77 

Cell means: See means in part b., standard error = 1.33 

See sums of squares partitions in analysis of variance table 

D x T linear: Fo = 44.3, F(.05, 2, 18) = 3.55, Reject 

D x T quadratic: Fp = 16.8, F(.05, 2, 18) = 3.55, Reject 

T linear: Fo = 12.9, F(.05, 1, 18) = 4.41, Reject 

T quadratic: Fo = 11.0, F(.05, 1, 18) = 4.41, Reject 

W = 0.41, chi-square approximation = 7.12,2 d.f., P = .0285. 


Source df SS MS Fo 
Time (linear) l 92.04 S218 12.12" 
Diet x T (linear) 2 631.08 315.54 41.53% 
Error 9 68.40 7.60 

Time (quadratic) 1 78.13 78.13 11.76% 
Diet x T (quadratic) 2 238.58 119.29 17.96% 
Error 9 59.76 6.64 


“Reject null hypotheses 


Subjects are random blocks in a randomized complete blocks design and each subject has 
received all treatments in random order 

Yij = Mt + pjt ei; eee ie ae 

7; = fixed fabric effect, p; = random subject effect, e;; = random experimental error with 
mean 0 and variance o?, assumed to be independent and normally distributed. Also assume 
there are no carryover effects of treatments 


Treatment 
Contrast 1 Z 3 4 5 
(i) 1 vs. 2 1 yh. 0 0 0 
(ii) 2 vs. 4 0 l 0 —1 0 
(iii) 2 vs. 5 0 l 0 0 -1 
(iv) 3 vs 4 0 0 l —1 0 


Orthogonal contrast pairs are (i) and (iv) and (iii) and (Wy). Sum of coefficient crossproducts 
of other pairs not equal to zero 


Source df SS MS 

Subjects 4 17.94 449 
Treatments 4 316.28 79.07 
Error 16... 227.08..:14.19 


Test for treatments: Fp = 5.57, F(.05, 4, 16) = 3.01, Reject 
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Contrast SS Fo 
Upyvs.2 = 24681 1739 
2 vs. 4 69.48 4.90% 
245.5 2132 - 58 
3 vs. 4 20.45 1.44 


*Reject null hypothesis since Fo > 
F(.05, 1, 16) = 4.49. 


Between-subjects design: Randomized complete blocks design with four varieties randomized 
to plots in each of three blocks 
Within-subjects design: Four levels of water application on the four subplots spaced at equal 
distances along the length of the variety plot. 
Distance 
Variety 1 2 e + 

1 416.0 447.3 436.3 328.3. 

Z 570.7 513.7 524.7 468.4 

3 358.1 429.2 449.1 326.9 

4 455.4. “S631 536.0° °336;2 


Yijk = Ut at pet dxt+ Bjt+(oP)gtege 149=1,2,3,4 k= 1,2,3 

a; = fixed variety effect, p, = fixed block effect, 8; = fixed distance effect, (a);; = fixed 
variety x distance interaction effect, dj, = whole-plot random error with mean 0 and variance 
v5, €;;, = subplot random error with mean 0 and variance a2. The subplot errors are assumed 
to satisfy the Huynh—Feldt condition of equal variances for differences between all distances. 


Source df SS MS 

Blocks 2 27,000 13,500 
Varieties (V) 3 128,063 42,688 
Error (1) 6 41,902 6,984 
Distance (D) 3. 409,128 . 39,709 
D \inear 1 39,145 39,145 
D quadratic l 70,211 oa 
D cubic 1 3766. 3708 
VxD 9 44,837 4,982 
V x D linear 3 7,499 2,500 
V x D quadratic 3 36,473 12,158 
V x D cubic 3 864 288 
Error (2) OA... 190,735 1406 


Varieties: Fp = 6.11, F(.05, 3, 6) = 4.76, Reject 

Distance: Fy = 5.36, F'(.05, 3, 24) = 3.01, Reject 

V x D interaction: Fy = 0.67, F(.05, 9, 24) = 2.30, Do not reject 

Variety means: 407.0, 519.4, 390.8, 473.1, standard error = 24.1 

Distance means: 450.0, 488.3, 486.5, 365.4, standard error = 24.8 

Cell means: See table in part b., standard error = 49.7 

See sums of squares partitions in analysis of variance table. 

V x Diinear: Fo = 0.34, V x D quadratic: Fy = 1.64, V x D cubic: Fo = 0.04, 
F(.05, 3, 24) = 3.01. Do not reject any interaction null hypothesis 

D linear: Fp = 5.29, D quadratic: Fy = 10.29, D cubic: Fo = 0.51, 

F(.05, 1,24) = 4.26, Reject for linear and quadratic partitions 

W = 0.29, chi-square approximation = 5.82, 5 d.f., P = .32 

G-G epsilon = .5634, adjusted d.f.: (Distance, 1.69), (Variety x Distance, 5.07), (Error, 
13.57); Conclusions do not change 


Chapter 16 


d. 


CHAPTER 16 657 


Source df MS Fo. Fes 
Blocks Zz 55,834 

D linear 1 39,145 11.4 0149 
V x D linear 3 2,500 0.7 5718 
Error 6 3,346 

Blocks 2 1,044 

D quadratic 1 76,217 222137 .0035 
V x D quadratic 3 12,158 3:5 0916 
Error 6 3.516 

Blocks 2 2,138 

D cubic 1 3,766 L3 3052 
V x D cubic 3 288 0.11 9594 
Error 6 2,997 


Yes. Each treatment follows each of the other treatments one time, and each treatment occurs 
once in each period and once with each subject. 

Yijk = B+ 04 + bij + Vet Ta(i,k) + Ac(i,k—1) + € ik 

£0,024 4-7,3,4 32 3.2 

a; = fixed treatment sequence effect, b;; = random subject within sequence effect with mean 
0 and variance a Ye = fixed period effect, 74(;,) = fixed direct effect of the treatment in 
period k of sequence 7, A,(i,4-1) = fixed carryover effect of the treatment administered in 
period k — 1 of sequence i, e;;, = random error for the subplot in period k with mean 0 and 
variance a”. Experimental errors are assumed to be normally distributed and at least satisfy 
the Huynh-Feldt condition of equal variances of differences between all time periods. Also 
assume no carryover effects higher than first order. 


Source df SS MS 
Sequence 3 2,10 O92 
Subjects 4 4.71 1.18 
Periods 3 0.05 0.02 
Drug (adj.) 3 40425 3.08 
Carryover (adj.) 3 0.06 0.02 
Error 15 £99 60.13 


Drug: fo-=-62.15, F(.05,3,15) =3.29, Reject 
Carryover: Fo = 0.15, F(.05, 3, 15) = 3.29, Do not reject 


Least Squares Mean for Treatment 


A B c D 
Contrast 1.17 1.16 3.28 1.17 Cc Co 
A vs.B l — 1 0 0 0.01 0.19 
B vs. C 0 1 —] 0 — 2.12 0.19 
B vs. D 0 l 0 —] — 0.01 0.19 


A vs. B: tp = 0.05, Bvs. C: tp) = — 11.16, Bvs. D: tp = — 0.05, 
Bonferroni t: t(.025, 3, 15) = 2.69, Reject for B vs. C, Do not reject for A vs. B and B vs. D 


Yes. Every treatment follows all treatments including itself one time. 
Yes. Carryover and direct treatment effects are orthogonal in this design, whereas they are not 
orthogonal in the four-period design. 
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Source df SS MS 


Steer at: 20.95 6.98 
Period 4 ito 293 
Roughage 3 114.55 38.18 
Carryover 3 320° 1.06 
Error 6 12.50 2.08 


. Roughage: Fo = 18.36, F(.05, 3,6) = 4.76, Reject 


Carryover: Fo = 0.52, F(.05, 3, 6) 
s(ii; — Zi.) = 0.93, d(.05, 3, 6) = 


= 4.76, Do not Reject 
3.10, Reject if |Z; -— 7, | > D(3, .05) = 2.88 


95% SCI 
Treatment fi; fi; — f,; (L,U) 
Cottonseed 72.79 — 6.67 (-— 9.55, — 3.79) 
Bermuda 75.74 — 3.88 (— 6.76, — 1.00) 
Wheat 77.49 —2.113 (— 5.01, 075) 
Alfalfa 79.62 Control 


The dry matter digestion of cottonseed and Bermuda is less than alfalfa since SCI does not 
include 0. 


Yes. Each treatment precedes all other treatments three times. 
e =p +agtyjt+ Td(i,j) + A te5-1) + Cij ee Op hanes © 4, d, c= 1,2, 3,4 

i; = fixed sequence effect, 7; = fixed period effect, 74; ;) = fixed direct keyboard layout 
fiat effect in period 7 and sequence 7, A,(;,;-1) = fixed carryover effect of treatment 
administered i in period (j — 1) and sequence 2, 
e;; = random error with mean 0 and variance o?. Experimental errors are assumed to be 
normally distributed and to at least satisfy the Huynh-Feldt condition of equal variances 
between all time’ periods. Assume no carryover effects higher than first order. The normal 
distribution assumption may not be valid since the response variable 1 is the discrete number of 
errors committed by subjects. 


Source df SS MS 

Sequence 11 223. i441 
Period 3 0.40 0.13 
Layout (adj.) 3° ONS8.73 $2:91 
Carryover (adj.) 3 121 0.40 
Error a7 13.58 0.50 


Layout: Fo = 105.82, F(.05, 3,27) = 2.96, Reject 
Carryover: Fo = 0.80, F(.05, 3,27) = 2.96, Do not reject 


Vij = bi + B(zi; — ZF) + ec; 1=1,2,3 akc eo 

4; = alloy mean, z;; = weld diameter, G = coefficient for linear regression of weld strength, 
y, on weld diameter, x, e;; = random experimental error with mean 0 and variance o?. 
Assume normally distributed errors with homogeneous variance; @ is the same for all alloys; 
and alloy treatments do not affect weld diameter 


Source _— df SS MS 

Regression 1 2,838.92 2,838.92 
Alloys 2 2,005.97 1,002.99 
Error 14 716.16 51.15 
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Regression: Fy = 55.50, F(.05, 1, 14) = 4.50, Reject, # = 5.505 
Alloys: Fo = 19.61, F(.05, 2, 14) = 3.74, Reject 
Alloy ¥;, Std Err 
L673. 3.29 
2 78.04 3.25 
a 5088 . 2.92 


Average standard error of difference = 4.63 
Plot the following regression lines: 
Alloy 1:9 = 50.42 + 5.505(a — 15.83) 
Alloy 2: 7 = 88.75 + 5.505(a — 19.83) 
Alloy 3: 9 = 51.50 + 5.505(a — 18.00) 
E = 3.69 
MS(homogeneity) = 129.13, 2 d.f.; MSE = 38.16, 12 df. 
Fo = 3.38, F(.05, 2, 12) = 3.89, Do not reject 
coefficient(std error): 8, = 3.92(1.07), By = 7.35(0.96), 83 = 4.19(1.44) 


Yj = M+ +pj+ B@ii-T)t+exj t=1,2,3 j= 1,2,3,4 

7; = irrigation effect, p; = block effect, z;; = number of plants, 8 = coefficient for linear 
regression of yield, y, on number of plants, e;; = random experimental error with mean 0 and 
variance a”. 

Assume normally distributed errors with homogeneous variance; 8 is the same for all 
irrigations; and irrigation treatments do not affect number of plants 


Source df SS MS 
“Blocks =i? 150 0 
Regression ] 1.08 1.08 
Irrigation Z 152. 0.76 
Error 3 0.44 0.09 


Regression: Fp = 12.0, F(.05, 1,5) = 6.61, Reject, B = 0.063 
Irrigation: Fo = 8.44, F(.05, 2,5) = 5.79, Reject 
Irrigation Y,, Std Err 


291 0.15 
2 A253 0.15 
3 2.11 0.15 


Average standard error of difference = 0.21 
E278 


~ 


Ery = 1.825, Ezz = 0.50, 6 = 3.65 = missing value estimate 


Source df SS MS 
Blocks (unadj.) 3 1.95 

Regression (x) ] 6.66 6.66 
Irrigation (adj.) 2 1.38 0.69 
Error 5 124 0.25 


Irrigation: Fo = 2.76, F(.05,2,5) = 5.79, Do not reject 
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a-designs, 348 
construction, 349 
efficiency, 349 
Latinized row-column, 351 
Additivity 
factorials, 182, 205 
Latin squares, 286 
randomized complete blocks, 266 
Adjusted 
sum of squares, 316 
treatment mean, 556 
treatment total, 316, 344 
Alias, 394 
Analysis of covariance 
assumptions, 559 
adjusted treatment mean, 556, 568 
efficiency, 558 
linear model, 553, 560, 566 
standard errors, 558, 568 
Analysis of variance 
assumptions, 43, 47, 57, 123 
missing data, 60, 208, 291, 486 
series of experiments, 296 
subsamples, 159 
unequal subsample numbers, 164 
see also individual designs 
Associate classes, 324 


Balance, 312 
carryover effects, 531 
incomplete blocks, 312 
partial, 323 
Balanced incomplete block design, 312 
analysis of variance, 316 
linear model, 315 
randomization, 313 
standard errors, 318 


treatment mean, 318 
Baseline period, 538 
Between-subjects design, 493 
Blocking 

criteria, 10, 264, 310 

purpose, 9, 263, 310 

random, 275 

relative efficiency of, 25, 272, 286 

row—column, 276, 320, 342 

size, 354 

sum of squares for, 267, 316, 342 
Block by treatment interaction, 271 
Bonferroni, 96 
Box-Behnken designs, 435 


Canonical analysis, 445, 467 
Cell means model, 42, 181, 201, 211 
Central composite designs, 431 
Coefficient of variation, 19 
Comparative observational study, 4 
Comparisons. See Contrasts 
Comparisonwise error, 92 
Completely randomized design, 13 

analysis, 47, 56, 59, 62 

analysis of variance, 55, 62 

linear model, 42 

equal replication, 38 

randomization, 39 

unequal replication, 60 
Components of variance. See Variance 
components 
Compound symmetry, 496 
Confidence intervals 

contrasts, 76 

inference, 80, 95 

simultaneous, 96 

treatment mean, 59 
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variances, 153 Double blind trial, 28 
vs P-values, 79 Dunnett test, 104 
Confounding, 369 
analysis of variance, 371, 374 Effects 
complete, 369 additive, 182, 205 
defining contrast, 379, 383 carryover, 522 
general method, 378, 383 direct, 524 
partial, 372 factorial, 177 
standard errors, 376 fixed, 148, 182 
2” factorials, 369 interaction, 178 
3” factorials, 383 main, 178 
Contrasts, 74 random, 148 
orthogonal, 81 simple, 177 
pairwise, 107 treatment, 53 
planned, 74, 94, 115 Efficiency 
regression. See Orthogonal polynomials analysis of covariance, 558 
repeated measures, 500, 506 incomplete block designs, 325 
standard error, 75 relative, 25 
sum of squares, 76 Efficiency factor, 
unequal replication, 82, 110, 104, 106, a-designs, 349 
114 crossover designs, 531 
Control treatment, 6, 104 incomplete block designs, 326 
Correlation lattice designs, 341 
intraclass, 155 nested row-column designs, 345 
repeated measures, 495, 511 Error 
serial, 512 experimental, 4 
subplot, 472 Type I and II, 19, 92 
Covariance, 70, 495 Even—odds rule, 371, 397 
Covariate, 14 Expected mean squares 
Crossover designs, 520 completely randomized design, 56, 62, 70 
analysis of variance, 526 factorials, 186 
balance for carryover effects, 523 mixed models, 239, 241, 242 
data coding, 545 nested and crossed factors, 253 
efficiency, 531 nested factors, 246, 248, 251 
extra period, 534, 539 random models, 152. 157. 234, 237 
incomplete blocks, 535 rules, 255 
linear model, 524 series of experiments, 295 
standard errors, 548-9 split-block, 486 
switchback, 539 split-plot, 473 
two treatments, 536 subsamples, 162, 167 
Williams, 532 Experiment, 3 
Cyclic designs, 345 Experiment design, 12 
Experimental error, 4 
Data files, 41 variance of, 4, 17, 43 
Defining contrast Experimental unit, 4 
2” factorial, 379 vs. observational unit, 17 
3” factorial, 383 Experimentwise error, 92 
Degrees of freedom, 54 strong control, 93 
Satterthwaite approximation, 168 weak control, 93 
Design plans 
a-designs, 361 f-value, 142 
balanced incomplete block, 330 F Max test, 114 
cyclic designs, 360 F test, 57 
incomplete Latin squares, 332 approximate, 168, 236 
Latin squares, 307 Factor, 7, 176 
references, 313, 321, 324, 353, 416, 439 notation, 176 


Design resolution, 398 qualitative, 8, 177 


quantitative, 7, 177 
sparsity, 392 

Factorial treatment designs, 7, 175 
analysis of variance, 186, 202 
complete block designs, 289 
confounded, 369 
effects, 177 
fractional replication, 393 
incomplete blocks, 369, 383 
linear model 181, 201 
standard errors, 188, 204, 212, 217 
unequal replication, 208 

Fisher, R. A., 1 

Fractional factorials, 391 
analysis, 399 
complementary fraction, 396 
construction, 396, 407, 409 
defining relation, 395 
design generator, 394 
design resolution, 398 
fold-over technique, 413 
Plackett-Burman designs, 411 
principal fraction, 396 
randomization, 398 
saturated design, 411 
standard errors, 405 

Full model 
analysis of covariance, 554, 566 
balanced incomplete block design, 315 
completely randomized design, 43 
crossover design, 527 
factorials, 227 
missing data, 214, 291 
mixture designs, 455 
regression, 463 


Gradient treatment designs, 510 
Greenhouse-Geisser € adjustment, 505, 519 


Hierarchical design. See Nested factor 
Homogeneity of variances, 47 

F max test, 130 

Levene (med) test, 128 
Huynh-Feldt 

condition, 497-8 

€ adjustment, 505, 519 


Incomplete block designs, 310 
2” factorials, 369 
3” factorials, 383 
a-designs, 348 
balanced, 312 
balanced lattice, 338 
choosing, 352 
cyclic, 345 
crossover studies, 535 
efficiency, 325 
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Latinized, 351 
lattice square, 343 
partially balanced, 322 
randomization, 313 
rectangular lattice, 341 
resolvable, 339 
response surfaces, 436 
row—column, 320, 342 
Inference 
confidence interval, 80, 95 
confident direction, 95 
confident inequality, 95, 108, 114 
strength of, 95 
Interaction 
block by treatment, 271, 286 
factorials, 175, 178 
generalized, 377, 381, 386 
treatment by experiment, 296 
Interaction as error, 205 
Intrablock analysis, 319 
Interblock information, 320 


Latin rectangle design, 287 
Latin square design, 275 
analysis of variance, 282 
linear model, 281 
multiple, 287 
nonadditivity, 286 
randomization, 279 
relative efficiency, 286 
standard errors, 283-4 
Lattice designs, 341 
Lattice square designs, 343 
Least significant difference (LSD), 110 
Least squares, 47 
balanced incomplete blocks, 336 
completely randomized design, 47 
factorials, 225 
regression, 463 
Levene (med) test, 128 
Linear model, 45 
generalized, 140 
Local control, 8 
Logit, 134 


Matching, 11 
Mauchly test statistic, 498 
Missing data 
blocked designs, 291 
completely randomized design, 60 
factorials, 208 
random models, 157, 164 
split-plot design, 486 
Mixed model, 239 
expected mean squares, 241 
repeated measures, 498 
standard errors, 240 
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Mixture designs, 447 
analysis, 453 
augmented simplex-centroid, 450 
interpretations, 458 
pseudocomponents, 452 
simplex-centroid, 450 
simplex coordinate system, 448 
simplex-lattice, 449 

Model. See Linear model 

Multiple comparisons, 91 
with the best, 98 
with the control, 104 

Multiple range test, 110 


Nested and crossed factor designs, 251 
analysis of variance, 254 
expected mean squares, 253 
linear model, 252 

Nested factor designs, 243 
analysis of variance, 246 
expected mean squares, 246 
linear model 245 
staggered, 249 
standard errors, 247 

Nested row—column designs, 343 

Nonadditivity. See Additivity 

Noncentral F’, 63 

Noncentrality parameter, 63, 208 

Nonorthogonal, See orthogonality 

Normal equations 
completely randomized design, 47 
balanced incomplete blocks, 337 
factorials, 225, 229 
regression, 464 

Normal probability plot, 126 


Observational unit, 16 

Orthogonality 
blocks and treatments, 291, 315, 344 
contrasts, 81 
crossover designs, 526, 534 
factorials, 209 

Orthogonal polynomials, 85, 190 
standard errors, 89 

Outliers, 131 


Pairwise comparisons, 107 
Einot-Garbiel, 114 
Hayter, 116 
least significant difference (LSD), 110 
Student-Newman-Keuls, 112 
Tukey HSD, 108 
Partially balanced incomplete block design, 
S22 
Path of steepest ascent, 429, 432 
Placebo, 7, 27, 60 
Plackett-Burman designs, 411 


Power, 19, 63, 158 
Profile plot, 493 
Probit, 134 
Pseudoreplication, 18 
P-values, 58 

vs confidence intervals, 79 


Quantile, 126 
Quantile-quantile plot, 126 
Randomization, 20. See individual designs 
randomization tests, 21 
restricted, 24 
Randomized complete block design, 264 
analysis of variance, 268 
linear model, 266 
multiple plots per block, 271 
nonadditivity, 271 
random blocks, 275 
randomization, 265 
relative efficiency, 273 
standard errors, 269 
subsamples, 272 
Random model, 151, 234 
expected mean squares, 152, 234, 237 
Rectangular lattice designs, 341 
Latinized, 351 
Reduced model, See Full model 
Regression, 441, 463 
Relative efficiency. See individual designs 
defined, 25 
quick check, 274 
Repeated experiments, 292 
Repeated measures designs, 492 
analysis of variance, 500, 505, 508 
regression contrasts, 500, 506 
in space, 510 
standard errors, 502, 506 
Replication, 16 
unequal, See Missing data 
Replication number 
for factorials, 208, 255 
for F test, 63 
for variances, 158 
for z test, 18 
Research hypothesis, 5 
Residual, 124 
absolute, 128 
factorials, 190 
randomized complete blocks, 272 
standardized, 131 
studentized, 132 
Residual plot, 128 
normal probability, 127 
residual-fitted spread, 142 
spread-location, 129 
Residues mod m, 378 
Resolvable designs, 339 


analysis of variance, 342, 345 
Response surface 

analysis, 440 

canonical analysis, 445, 467 

exploration, 445 

first-order model, 425 

second-order model, 425 

sequential experiments, 425 

stationary point, 444, 466 
Response surface designs 

2” factorials, 426 

3” factorials, 431 

Box-Behnken, 435 

central composite, 431 

incomplete blocks, 436 

rotatable, 433 | 
Row-—column designs, 320, 342, 522 


Sample size. See Replication number 
Sampling. See Subsampling 
Satterthwaite approximation, 168 


Scheffe test, 97 
Screening studies, 115, 392 
Simultaneous inference, 94 
Split-block design, 483 
Split-plot designs, 469 
analysis of variance, 473, 480 
experimental errors, 472 
experimental units, 471 
linear model, 473 
randomization, 471 
relative efficiency, 481 
standard errors, 478, 484 
Staggered nested designs, 249 
Standard error of contrasts, 75 
Standard error of difference 
analysis of covariance, 558, 568 
crossover designs, 548 
Dunnett test, 104 
incomplete blocks, 318 
Latin squares, 284 
LSD test, 111 
mixed models, 240 
randomized complete blocks, 270 
split-plots, 478, 484 
Standard error of effects . 
2” factorials, 369 
fractional factorials, 405 
Standard error of the mean, 
analysis of covariance, 558 
completely randomized design, 59, 63 
factorials, 188, 204 
incomplete blocks, 318 
Latin squares, 283 
nested designs, 248 
randomized complete blocks, 269 


INDEX 665 


random models, 154 
with subsamples, 162 
unequal replication, 104, 110, 114, 212 
Strip-plot design. See Split-block design 
Student-Newman-Keuls (SNK) test, 112 
Subplot, 469 
Subsamples, 159 
complete block designs, 272 
expected mean squares, 162, 167 
optimum allocation, 163 
unequal number, 164 
Sum of squares, 


adjusted treatment, 316, 345, 527, 548, 


555 
experimental error, 47 
unadjusted block, 316 

Sum of squares partitions 
analysis of covariance, 555, 561, 567 
complete randomized design, 52 
contrasts, 76 
factorials, 185, 202 
incomplete blocks, 316, 325 
Latin square, 282 
orthogonal polynomials, 87 
randomized complete blocks, 268 
response surface analysis, 441 
series of experiments, 297 
subsamples, 161 

Switchback design, 539 


Taguchi, G., 413 
Technique, 8 
Transformations, 133 
arcsin, 134 
ladder of powers, 135 
log, 135 
power, 135 
square root, 133 
Treatment, 4 
Treatment design, 5 
Treatment effect estimates 
factorials, 226 
intrablock, 338 
row—column designs, 345 
Treatment mean, 43 


least squares estimate, 47, 182, 212, 317 


Tukey HSD test, 108 

2” factorials, 363 
effects, 365 
identity column, 368 
incomplete blocks, 369 
nomenclature, 364 
standard order, 367 

3” factorials, 383 


Uniformity trial, 11 
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Variance 
confidence interval, 153 
of a difference, 325, 531 
experimental error, 4, 17, 43 
homogeneity tests, 128 
of a mean, 25, 162, 247, 275 
Variance components, 151, 234 
analysis of variance estimates, 152, 234 
estimation, 258 
negative estimates, 155 
models, 151, 234, 245, 252 
Washout period, 522 
Weighted squares of means, 215 
Whole plot, 469 
Williams crossover designs, 532 
Within-subjects design, 493 


